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(57) Abstract: The invention discloses a family of cloning vectors capable of cloning nucleic acid inserts of interest of long sizes, 
with low or reduced background and high efficiency of excision and method for preparing these vectors and library thereof.As ex- 
ample, it is disclosed a cloning vector comprising a construction vector segment (CS) and a replaceable segment (RS), wherein the 
size of CS is: 36.5 kb < CS < 38 kb, preferably CS is 37.5 kb, comprising lox recombination sites for Cre -recombination and/or att 
recombination sites for Gateway-like recombination, preferably also a background-reducing system selected from the group of: the 
ccdB gene, a lox sequence, the lacZ gene, and asymmetric site sequences recognized by restriction endonucleases. 
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DESCRIPTION 

CLONING VECTORS AND METHOD FOR MOLECULAR CLONING 

FIELD OF THE INVENTION 
5 The present invention relates to recombinant DNA technology. In 

particular, it is disclosed a novel cloning vector family and in vitro and in 
vivo method for cloning of nucleic acids of interest. 
BACKGROUND ART 

Efficient genomic and cDNA cloning vectors are important tools in 

10 molecular genetic research, because high quality, representative libraries are 
rich sources for the analysis of many genes. 

Full-length cDNAs are the starting material for the construction of 
the full-length libraries (for example, the RIKEN mouse cDNA encyclopedia, 
RIKEN and Fantom Consortium, "Functional annotation of a full-length 

15 mouse cDNA collection", Nature, February 8, 2001, Vol. 409:685-690). In 
contrast to standard cloning techniques, full-length cDNA cloning has the 
inherent risk of under representation or absence of long clones from the 
libraries, and cDNAs deriving from very long mRNAs are not cloned if the 
capacity of the vector is not sufficient. 

20 Available plasmid cloning vectors show bias for short cDNAs: shorter 

fragments are cloned more efficiently than longer ones when competing 
during ligation and library amplification steps. Although plasmid 
electroporation does not show relevant size bias, during circularization of 
plasmid molecules in the ligation step, in a mixed ligation reaction, short 

25 cDNAs are ligated more efficiently than longer cDNAs (Sambrook et al., 

1989, Cold Spring Harbor Laboratory Press, Molecular Cloning, NY, USA). 
Cloning vectors derived from bacteriophage have been disclosed as 
particularly useful for cloning, propagation of DNAs and for library 

1 
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construction. Ligated mixtures of insert and bacteriophage vector DNAs 
can be efficiently packaged in vitro and introduced into bacteria by infection. 

Bacteriophage vectors allow cloning of cDNAs sequences, however, 
the final product for large-scale sequencing should be a plasmid for large- 
5 scale colony picking, propagation, DNA preparation and sequencing 
reactions (Shibata et al., 2000, Genome Res. 10: 1757-1771). 

Cloning vectors for automatic plasmid excision should have a 
capacity for wide-range cDNA cloning, that is including cDNAs as short as 
0.5 Kb and as long as 15 Kb, which are visible on agarose gel when using 
10 trehalose during the first strand cDNA synthesis (Carninci et al., 1998, Proc. 
Natl ScL USA, 95:520-524). 

There are a number of bacteriophage vectors allowing whole library 
bulk excision, but they are not optimal in terms of cloning size or bulk 
excision protocol. 

15 Examples of plasmid excision from bacteriophage vector having a 

cloned insert were obtained with the X-Zap II (Short et al, 1988, Nucl. Acids 
Res., 16:7853-7600). However, the bulk excision from X-Zap II shows size 
bias towards short inserts when using a mixed sample like a cDNA library, 
which contains both short and long clones. Using X,-Zap II, long and rare 

20 cDNAs are difficult to obtain. 

Other vectors designed for cDNA cloning and plasmid excision like 
the k-Lox derivatives (Palazzolo M. et al., 1990, Gene, 88: 25-36), X-YES 
(Elledge et al., 1991, Proc. Natl Acad. Sci. USA., 88: 1731-5) and 
X-Triplex™ (CLONTECHniques, January 1996), accept cDNAs that do not 

25 exceed 9~10 Kb. Alternatively, vectors for genomic libraries construction 
and Cre-lox mediated plasmid excision accept inserts longer than 7 Kbp, 
such as X PS (Nehls et al., 1994a, Biotechniques, 17: 770-775), XpAn (Holt et 
al., 1993, Gene, 133: 95-97), A,GET (Nehls et al., 1994b, Oncogene, 9: 2169- 
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2175), X-MGU2 (Maruyama and Brenner, 1992, Gene, 120: 135-141) and a 
vector based on Tnl721 excision system, X,RES (Altenbucher, J, 1993, Gene, 
123: 63-68). However, these vectors do not allow the preparation of wide 
range size cDNA libraries. 
5 Only among the XSK series there were some vectors with calculated 

capacity between 0.2 to 15.4 Kb (Zabarovski et al., 1993, Gene, 127: 1-14), 
which would be suitable for wide-range size cDNA cloning purpose. 
Unfortunately, the rudimental excision system of XSK is based on simple 
restriction digestion, which causes internal cleavage of cDNA clones and 
10 probably this is the reason why these vectors are not commonly used for 
cDNA cloning. 

Japanese patent application having publication number P2000- 

325080A, discloses a modified X PS vector. The new vector, indicated with 

the term A,-FLC-1, comprised a 6 kb nucleic acid sequence (stuffer II) in the 
15 left arm of the X PS vector so that the size of the vector, without considering 

the cDNA of interest, was 38 kb. This modified X PS vector was described 

as being able to insert broad range size of cDNAs. 

The X-FLC-l, even if useful for generic (or "standard") large size 

cDNA libraries, still shows a bias for short and not full-length cDNAs, so 
20 that very long, rare and important full-length cDNAs are difficult to obtain, 

in particular, in case of strongly normalized and/or subtracted cDNA 

libraries. 

A further problem in the art refers to the efficiency of bulk excision 
recombination mechanism. 
25 Bulk cDNAs (cDNA library), that is a library of cDNA comprising a 

wide range size of cDNAs, short, medium and long ones, are inserted in 
cloning vectors. These inserts are then transferred in other functional or 
specialized vectors that have desired characteristics, such as expression 
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vectors. This transfer is called subcloning. The functional or specialized 
vectors used for subcloning DNA segments are functionally diverse. These 
include but are not limited to: vectors for expressing genes in various 
organisms; for regulating gene expression; for providing tags to aid in 
5 protein purification or to allow tracking of proteins in cells; for modifying the 
cloned DNA segment (e.g., generating deletions); for the synthesis of probes 
(e.g, riboprobes); for the preparation of templates for DNA sequencing; for 
the identification of protein coding regions; for the fusion of various protein- 
coding regions; to provide large amounts of the DNA of interest, etc. It is 

10 common that a particular investigation will involve subcloning the DNA 
segment of interest into several different specialized vectors. 

Traditional subcloning methods, using restriction enzymes and ligase, 
are time consuming and relatively unreliable. 

The use of recombinase recognition systems using specific 

15 recombinase recognition sequences have been proposed and they are known 
as Cre-lox (Palazzolo et aL, 1990, Gene, 88: 25-36) and Gateway™ (Life 
Technologies Catalogue; Walhout A.J.M., et al., 2000, Methods in enzymology, 
Vol.328: 575-592; and US 5,888,732). 

The Cre-recombinase solid-phase in vivo excision requires infection 

20 of the amplified cDNA library into a bacterial strain, which constitutively 

express the Cre-recombinase, for instance BNN132 (Elledge et aL, 1991, Proc. 
Natl Acad. Set USA, 88: 1731-5). However, this is not recommended because 
of low plasmid yield (Palazzolo et al., 1990, as above) and plasmid instability 
(Summers et al., 1984, Cell, 36: 1097-1103): in fact, Cre-recombinase is 

25 constitutively expressed causing formation of plasmid dimers/multimers 
leading to high proportion of plasmid-free cells (Summers et aL, 1984, as 
above), impairing the sequencing efficiency. 

The Gateway excision is an alternative system to the Cre-lox excision. 
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According to the general Gateway™ system, an insert donor vector carrying 
a DNA of interest (insert) and a pair of recombinant sites different from each 
other, recombines with a donor vector comprising a subcloning vector and a 
pair of recombinant sites different from each other, but able to recombine 
5 with the insert donor vector recombination sites. The final product is a 
subclone product carrying the DNA of interest (insert) and a byproduct. 
The recombinant sites are attB, attP, attL and attR. 

However, the Gateway™ system shows a bias for short cDNA; long 
cDNAs are obtained with low efficiency (Michael A. Brasch, slide "Gateway 

10 cloning of attB-PCR products", GIBCOBRL® Technical Seminar, "Gateway 
Cloning Technology", Life Technologies™, 1999). 

Another further problem in the cloning system consists in the 
presence of background, which is due to environmental DNA contamination 
and to subcloning process byproducts, that is a non recombinant plasmids 

15 (plasmids without the DNA of interest). 

It is instead highly desirable having a background-cutting cloning 
system, able to eliminate completely or having a little background. 

Some background-cutting strategies have been proposed in the art. 
Walhout et al. (as above), for example, reports that the Gateway™ vectors, 

20 attPl-attP and attRl-attR2, also contain between the att sites the ccdB gene 
(Bernard P. and Couturier M., 1992, J. Mol Biol, 226:735-746), whose 
protein product interferes with DNA gyrase. After recombination, only the 
plasmids that have lost the ccdB gene (and which are recombinant) can grow 
in E.coli strains not mutated for gyrA, therefore providing a selective 

25 advantage. 

Plasmids carrying the gene ccdB can propagate only in specific E.coli 
strain, DB3.1, which carries a mutation in gyrA gene conferring resistance to 
ccdB (Walhout et al., as above). Therefore, this kind of recombination is 
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limited to plasmids, since other vectors for instance X substitution vectors 
used in cloning systems cannot grow and replicate in cells like DB3.1, which 
miss the recA protein (the recA product is required for the growth of 
substitution- type bacteriophage hiSambrook et aL, 1989). 
5 In conclusion, there is the need in his field of the art of providing of 

vectors having the characteristics of: i) being size bias free and allowing the 
preparation of "size balanced" comprising very long, rare full-length cDNAs; 
ii) capable of improved recombination mechanism; and iii) able of 
background cutting. 

10 The cloning vectors available in the state of the art, fail to satisfy the 

above characteristics. 

The invention disclosed in the present application is addressed to 

solve the problems in the art. 

SUMMARY OF THE INVENTION 
15 The present inventors provide a new family of vectors capable of 

cloning nucleic acids of wide range size and preferably very long ones, with 

high efficiency of excision and reduced background and contamination. Also 

provided are methods of cloning and for preparing bulk library using such 

vectors. 

20 According to a first embodiment, the invention provides a cloning 

vector comprising a construction vector segment (CS) and a replaceable 
segment (RS), wherein the size of CS is: 36.5 kb <; CS < 38 kb, preferably 
CS is 37.5 kb. The construction vector segment preferably is made or 
comprise a bacteriophage X vector fragment. The replaceable vector 

25 segment (RS) represents the segment, which is replaced by the nucleic acid 
insert of interest, which one intends to clone. 

It has been surprisingly found that a cloning vector with this size is 
capable of preferably inserting cDNA of very long sizes, and it is therefore 
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particularly advantageous for cloning very full-length cDNAs. This vector 
overcomes the problem in the art of existing vector A,-FLC having a 
construction vector segment of 38 kb, which showed a strong bias for short 
size cDNAs (see Table 1). 

The selection of a particular advantageous size of the vector for the 
preparation of full-length cDNAs libraries can also be applied to 
bacteriophage other than A,. Accordingly, the present invention also relates 
to a cloning bacteriophage vector comprising a construction segment (CS) 
and a replaceable segment (RS), wherein the size of CS is: X-1.2 kb <s CS < 
Xkb; X (expressed in kb) corresponding to the minimum size necessary to the 
bacteriophage vector for undergoing packaging. The size of CS is 
preferably: X-0.2 kb. 

The present invention also relates to a bacteriophage vector, 
preferably a X, comprising a bacterial artificial chromosome (pBAC) or a 
segment thereof comprising at least an origin of replication (ori). This 
vector can also comprise: a site into which a DNA fragment can be cloned; 
and a pair of inducible excision- mediating sites defining an excisable 
fragment that comprises the site into which the DNA fragment can be cloned. 
The pair of excision- mediating sites are preferably FRT sites. 

This vector may further comprise an inducible origin of replication, 
preferably oriV. 

The cloning vectors according to the invention are capable of carrying 
out plasmid or nucleic acid insert excision using known recombination 
systems, for example the Cre-lox and/or Gateway™ system. 

The vectors of the invention can also comprise a background- 
reducing system, as ccdB gene, a lox sequence or the lacZ gene or 
asymmetric site sequences recognized by restriction endonuclease. 

The invention also relates to cloning method using the above vectors. 
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According to another embodiment, the invention relates to a system for 
reducing background or contamination by providing a cloning vector 
comprising a backgroung-reducing sequence like ccdB gene and/or a lox 
sequence comprised into RS segment of the vector of the invention, or in case 
5 of the Gateway™ system into the RS segment of a destination or receiving 
vector. RS of phage or plasmid vectors can also be flanked by two 
asymmetric site sequences recognized by restriction endonuclease. 

The invention also relates to a method for reducing background or 
contamination by using these vectors. 
10 The invention also relates to methods for efficient excision of plasmid 

or nucleic acid of interest providing improved Cre-recombinase or Gateway™ 
system using the vectors according to the invention. 

Preferably, the present invention relates to method for the 
preparation of bulk of long or full-length cDNA libraries, by using the vectors 
15 according to the invention. 

The present invention also relates to a kit comprising at least a 
cloning vector or at least a library of vectors according to the invention. 

The present invention further relates to a method for preparing at 
least a normalized and/or subtracted library comprising using a plasmid 
20 vector obtained with the excision method according to the invention or 

destination vector according to the invention, preferably reduced at single 
strand, as normalization and/or subtraction driver. 
BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a general scheme of the vector family according to the 
25 invention. The following functional elements (not in scale) are produced in 
this work. In Fig. 1(a), the functional elements of the vector construction 
segment (CS) are: the left and right arms; the cloning size regulator (or 
stuffer II); a plasmid derivative of pBluescript; and the bulk excision 
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elements (recombination sites) loxP; the size of the construction segment 
(CS) is between 32 and 38.3 kb. The replaceable vector segment (indicated 
as stuffer I or RS) is flanked by the excision Gateway™ elements (attBl and 
attB2); this is the segment that will be replaced by the cDNA. 
5 At the right side of Fig. 1(a), it is shown the mechanism of plasmid 

excision according to the cre-lox system or the excision of cDNA inserts into a 
destination or receiving vector with the Gateway™ system. 

In Figure l(b)-(f) various constructions and sizes of the stuffer I (RS) 
are shown: stuffer I of (b) is 10 Kb as from X-PS vector; (c) is a short version 
10 of the stuffer I to simplify the arms purification; (d) is a 10 Kb stuffer with 4 
ccdB and two LacZ to cut the background; (e) is a 5 Kb stuffer with 2 ccdB 
and one Lac Z; (f) is a stuffer for the ccdB and lox P double background 
cutting. 

In particular, in (g), it is shown a non recombinant plasmid 
15 comprising the ccdB gene which inhibits growth, while LacZ (h) allows color 
selection. In (i) it is shown the background-reducing system using a loxP 
site, which separates the origin of replication and the resistant gene. 
Abbreviations: Sw = Swal, Sf = Sffl, Sp = Spel, Fs = Fsel, Pa = Pad, Xa = 
Xbal. 

20 The Pad, FseL, SfA, Swal, and the cloning sites cut only the sites 

that are shown and do not cut elsewhere in the vectors. 

Figure. 2. Several constructions for vectors according to the 
invention, which are for simplicity indicated with the generic name of A,~FLC 
are shown. 

25 (a) X-FLC-I-B and ^-FLC-I-E, having the stuffer I of Fig. lb and le, 

respectively, (b) k-FLC-I-L-B and X-FLC-I-L-D, which lack the stuffer II 
and have a stuffer I of Fig. lb and Id, respectively, cloning site as in (a), (c) 
X-FLC-II-C carrying the Gateway™ attBl and attB2 sequence for bulk 
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transfer of clones; it has a stuffer I like Fig.lc. (d) X-FLC-III-F having the 
stuffer I like in Fig. If for background reduction, (e) X-FLC-III-L-D which 
lack the stuffer II and has the stuffer I like in Fig.ld. (f) X-FLC-III-S-F, 
having the stuffer I like in Fig.lf but having a longer stuffer II (6.3 Kb). 
5 Vectors (d-e) have sites for homing endonucleases (I-Ceul and PI- 

Seel) next to the cloning site for easy transfer of inserts to other vect6rs; the 
cloning site is shown in (d) only. 

Vectors (g-j) show polylinker sequences which are placed at left and 
right side flanking the stuffer I (indicated in Fig.l(b-f)) or cDNAs (which is 

10 represented by a sequence of asterisks). The underlined sequences into the 
polylinkers represent primers, recombination sites, restriction sites, and the 
like. These restriction sites do not cut elsewhere in the h- vectors or in the 
plasmids at all. More specifically, in pFLC-I, the left polylinker (SEQ ID 
NO:l) comprises: Forward (Fwd) M13 primer site, site for T7 polymerase, 

15 recombination site loxP, restriction sites Sfil and Sail site sequences; the 

right polylinkers (SEQ ID NO:2) comprises: restriction sites BamHI and Sfil, 
site for T3 polymerase, Reverse (Rev) M13 primer site. In pFLC-II, the left 
polylinker (SEQ ID NO:3) comprises: Fwd M13 primer site, T7, attBl, Xhol 
and Sail; the right polylinker (SEQ ID NO:4) comprises: BamHI, attB2, loxP, 

20 T3, Rev M13 primer site. In pFLC-III, the left polylinker (SEQ ID NO:5) 
comprises: Fwd M13 primer site, T3, I-Ceul, Sail; the right polylinker (SEQ 
ID NO:6) comprises: BamHI, Pl-Sce T7, Rev M13 primer site. In pFLC- 
DEST, the left polylinker (SEQ ID NO:7) comprises: Fwd M13 primer site, 
T3, attBl, Xhol, Sail; the right polylinker (SEQ ID NO:8) comprises: BamHI, 

25 attB2, T7, Rev M13 primer site. 

The general pFLC-II of Fig.2h (i.e. without mentioning the specific 
stuffer I or the "insert cDNA") can be constructed by using a modified 
pBluescriptll SK. A general pFLC-II having this construct is shown in 
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Figure 13 and the entire sequence (without stuffer I or "insert cDNA") is 
shown in SEQ ID NO:51. 

Figure 3. Excision protocols. From left to right, in vivo solid phase 
Cre-recombinase (state of the art), in vivo liquid phase Cre-recombinase, in 
5 vitro Cre recombinase. On the right side, the "direct", "indirect", and 

"amplified indirect" protocols, which are mediated by the Gateway™ (GW) 
sequences and enzymes for in vitro excision. 

Figure 4. Average size of obtained cDNA libraries prepared with X- 
Zap II or X-FLC-I-B. 

10 Figure 5. This Figure shows possible vector constructions according 

to the present invention. 

The vector according to the invention can be circular or linear, 
comprising a first segment indicated as construction segment (CS) and a 
second segment indicated as replaceable segment (RS). In linear form the 
15 construction segment (CS) of the vector is represented comprising a left 

segment and a right segment. RS is the segment which will be replaced by 
the nucleic acid insert of interest, for example a full-length cDNA. 

The vector according to the invention can be circular or linear. 
In (a) and (b) recombination sites (here generally indicated as attl 
20 and att2), which do not recombine with each other, flanking RS, according to 
the Gateway™ recombination/excision system (Gateway™ Cloning 
Technology Manual, GIBCOBRL®, Life Technologies®) are shown. 

In c) and d), recombination sites (lox site in this case), which 
recombine with each other by the Cre-lox recombination mechanism are 
25 present in CS. 

In e) and f) it is shown that the Gateway-like sites flanking a RS and 
the recombination sites like the lox sites (shown in c) and d)) can be 
present at the same time. 

11 
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In (g), recombination sites flanking RS are two lox sites, which do not 
recombine with each other. They work in the same way as the Gateway 
sites do. 

In (h), it is shown the presence into RS of the gene ccdB as 
5 b ackground-re duction. 

In (i), it is shown the presence of a "third" lox recombination site as 
background-reducing sequence, capable of recombination with the lox site 
sequences in CS. 

Figure 6. Mechanism of action of a cloning vector comprising two 
10 homing endonuclease asymmetric recognition site sequences (a). These two 
sequences not capable of ligating with each other, are placed flanking a RS 
during the ligation process. Each of these sequences recognizes and ligates 
to one sequence flanking a nucleic acid insert of interest (b). Only ligation 
vector -insert is allowed. Ligations insert-insert or vector- vector are in this 
15 way avoided. 

Figure 7. It is described an example of preparation of X-FLC-III-E 
The stuffer If, is the stuffer I of Figure If. 

Figure 8. It is disclosed an example of excision of asymmetric 
recognition site sequences, in the specific example using homing 
20 endonuclease I-Ceul and Pl-Scel. 

Figure 9. It is described the preparation of a modified pBAC for the 
preparation of a X,-BAC vector. A detailed explanation of the process is 
disclosed in Example 20. 

Figure 10. It is described the insertion of loxP and Xbal sites into 
25 the modified pBAC of Fig.7. A detailed explanation of the process is 
disclosed in Example 20. 

Figure 11. It is described a chart comprising the steps for the 
preparation of the stuffer II ("component 5")- A detailed explanation of the 
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process is disclosed in Example 20. 

Figure 12, It is described a chart comprising the steps for the 
preparation of the A,-FLC-III-pBAC. A detailed explanation of the process is 
disclosed in Example 20. 
5 Figure 13. It is reported the full nucleotide sequence of an example 
of a general pFLC-II as described in Figure 2h (that is, without showing the 
sequence of the stuffer I or the "insert cDNA"). The "insert cDNA" or stuffer 
I (indicated in Fig.2h with a line of asterisks) is indicated in Fig.13 by a line 
between the sequences CTCGAG GGATCC. This construct of a 

10 general pFLC-II is a modified pBluescriptll SK(+). ■ 

The sequence of the plasmid of Figure 13 is indicated in SEQ ID 
NO:51 as a single sequence starting from the sequence GGATCC (above), 
and terminating with the sequence CTCGAG (above), therefore without 
indicating the sequence of specific stuffer I or cloning cDNA. 

15 Figure 14. This graph compares cloning vector ^-FCL-I-B of the 

present invention and conventional ZAP vector in terms of cloning efficiency. 
DETAILED DESCRIPTION OF THE INVENTION 

Full-length cloning has been hampered by problems related to both 
the preparation and cloning of long cDNAs. A consistent part of the 

20 problems has been overcome with the preparation of long cDNAs with 

thermostabilized and thermoactivated reverse transcriptase (Carninci et al., 
1998, Proc. Natl. Acad. Sci. USA. 95: 520-524) and the development of cap- 
based full-length cDNA selecting techniques (Carninci et al., 1996, Genomics, 
37: 327-336; Carninci et al., 1997, DNARes., 4: 61-66; Carninci et al., 1999, 

25 Methods Enzymol, 303: 19-44; Carninci et al., 2000, Genome Res., 10: 1617- 
1630). 

However, cloning methods and methods for preparing bulk cDNA 
libraries still showed a bias for short size cDNAs. 

13 
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The present inventors provide a new family of vectors capable of 
cloning nucleic acids with wide range size and preferably very long and full- 
length cDNAs, high efficiency of excision and reduced background and 
contamination. Also provided are methods of cloning using such vectors. 
5 According to a first embodiment, the invention provides a cloning 

vector comprising a construction vector segment (CS) and a replaceable 
segment (RS) (also indicated as "stuffer I") (Figure 1). RS is the segment 
that will be replaced by the nucleic acid insert of interest, which one intends 
to clone. 

10 The bacteriophage or plasmid vector of the invention can be both 

linear or circular (Fig.5, a-i). In case of a linear vector, the segment CS can 
be graphically considered as divided into two arms or segments, one at left 
side and the other at right side of RS. However, for more clarity the 
terminology of left arm or segment and right arm or segment of CS will be 

15 also maintained in case of circular vector. 

The vector available in the state of the art was a modified X PS 
vector having a "basic" size of 32 kb plus a 6 kb nucleic acid sequence (stuffer 
II), so that the size of the vector, without considering the cDNA of interest, 
was 38 kb (Japanese patent application having publication number P2000- 

20 325080A filed by the same applicant of the present invention). However, . 
this vector had the disadvantage of bias for short and non full-length cDNAs, 
the presence of which are inconvenient for the preparation of a full-length 
cDNA library or encyclopedia. 

The present inventors have surprisingly found that a vector, 

25 preferably a bacteriophage, more preferably a X bacteriophage, having the 
size of CS of: 36.5 kb ^ CS < 38 kb, preferably CS is 37.5 kb, allowed the 
selection of long and full-length cDNA avoiding the problem of the A, phage of 
38 kb. 
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The preferred size of 37.5 kb of CS according to the vector of the 
present invention is 0.2 kb shorter than the minimum size necessary for a X- 
phage to undergoing packaging, which corresponds to 37.7 kb (Zabarovski et 
al., 1993, as above). 

5 The advantages of the vector of CS 37.5 kb according to the invention 

compared to that of the state of the art of CS 38 kb is showed in Tablel. 

The system for avoiding the bias for short and for the preferable 
preparation of full-length cDNAs can also be applied for bacteriophages 
different from X. 

10 Accordingly, the invention also relates to a cloning bacteriophage 

vector comprising a construction segment (CS) and a replaceable segment 
(RS), wherein the size of CS is: : X-1.2 kb <; CS < X; X (expressed in kb) 
corresponding to the minimum size necessary to the bacteriophage vector for 
undergoing packaging (which nominally is 37.7 kb for X, as reported in 

15 Zabarowski et aL, as above). The size of CS is preferably: X-0.2 kb. 

The diminution of a short fragment from the size of X renders the CS 
fragment below the packaging level, however, the presence of the RS (also 
indicated as "stuffer I") makes the bacteriophage vector capable of packaging. 
In Figures 1 and 2, the vector according to the invention is 

20 constructed inserting a stuffer II of the desired size. Preferably, of 5.5 kb, 
so that the CS corresponds to a size of 37.5 kb. However, the stuffer II can 
be: 4.5 <; stuffer II < 6. The stuffer II can be of any origin and any nucleic 
acid. It can be a foreign sequence fragment, for example a mouse genomic 
DNA or can be taken from plasmid. The stuffer II can also be already 

25 originally present in the vector. 

The CS of the vector according to the invention can preferably be a 
bacteriophage segment, or comprise a bacteriophage fragment. Preferably, 
the bacteriophage is a A, bacteriophage. A list of available bacteriophage 
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and X bacteriophage has been reported in the state of the art of the present 
application (see for example those reported in Sambrook et aL, 2.16-2.53) or 
derivatives thereof. 

CS can also be modified by comprising a plasmid segment at least 
5 comprising a ori. The plasmid comprising ori is preferably selected from 
the group of: pBluescript (+), pUC, pBR322, and pBAC. In Figure 1, for 
example, a fragment of a modified pBluescript(+) comprising ori has been 
inserted into the left arm of CS. An example of use of pBAC or derivative 
thereof for the preparation of vectors according to the invention is given, for 

10 example in Figure 9-12 and Example 20. However, pBAC or its derivative 
can be efficiently used for the preparation of any vector contruct according to 
the invention. Examples of vectors and linker, adapter, primer sequences 
and the like that can be used in the construction of the vectors according to 
the invention are reported in the NCBI VecScreen, UNIVEC Build #3.2 

15 Database (National Centre for Biotechnology Information, National Library 
of Medicine, National Institute of Health, US). Specific information about 
these vectors can also be found in the Catalog of Amersham Pharmacia 
Biotech, Inc., US; Clontech Laboratories, Inc, US; Invitrogen Corporation, 
US; Life Technologies, Inc., US; New England Biolabs, Inc., US; Promega 

20 Corporation, US; and Stratagene, US. 

The cloning vector according to the invention can also comprise a 
selectable marker. Accordingly, CS comprises at least a selectable marker 
selected from the group consisting of: a DNA segment that encodes a product 
that provides resistance against otherwise toxic compounds (e.g. antibiotic 

25 resistant gene); a DNA segment that encodes a product that suppresses the 
activity of a gene product; a DNA segment that encodes a product that is 
identifiable (e.g. phenotypic markers such as beta-galactosidase, green 
fluorescent protein (GFP), and cell surface proteins); a DNA segment that 
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encodes a product that inhibits a cell function; a DNA segment that provides 
for the isolation of a desired molecule (e.g. specific protein binding sites); and 
a DNA segment that encodes a specific nucleotide recognition sequence 
which is recognized by an enzyme. 

The selectable marker is more specifically at least a marker selected 
from the group consisting of an antibiotic resistance gene, an auxotrophic 
marker, a toxic gene, a phenotypic marker, an antisense oligonucleotide; an 
enzyme cleavage site, a protein binding site; and a sequence complementary 
to a PGR primer sequence. 

Amp as an example of selectable marker is showed in Figures 1 and 

2. 

The RS of the vectors of the invention can be flanked by two 
recombination sites (as showed in Figures 1, 5) wherein these two 
recombination sites do not recombine with each other. More in particular, 
these recombination sites are selected from the group consisting of attB, attP, 
attL, and attR or their derivatives for carrying out the recombination 
excision according to the Gateway™ methodology (Walhout et al., 2000, as 
above; Life Technologies catalogue; Gateway Cloning Technologies, 
Instruction Manual, GibcoBRL, Life Technologies; and US 5,888,732). The 
complete list of Gateway recombination sites and derivatives is disclosed in 
the above Life Technologies references. 

The Gateway™ system has been proposed in the art for exchange of 
components between plasmids and for transferring a nucleic acid insert of 
interest into a specific functional plasmid. However, the Gateway system 
showed a bias for short cDNA; long cDNAs are obtained with low efficiency 
(Michael A. Brasch, slide "Gateway cloning of attB-PCR products", 
GIBCOBRL® Technical Seminar, "Gateway Cloning Technology", Life 
Technologies™, 1999). 

17 
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The present inventors have instead surprisingly found that when 
Gateway recombination sites are transferred into a bacteriophage vector 
according to the present invention and positioned flanking the RS (as shown 
in Figures 1, 2 and 5,a, b, e, f) the cloned cDNA library did not show bias for 
short cDNAs. 

The present invention therefore, provides a bacteriophage vector, 
preferably having a CS size of: 32 kb <; CS < 45 kb, in particular 36.5 kb <; CS 
< 38 kb, more preferably CS is 37.5 kb comprising two recombination sites, 
which do not recombine with each other, flanking RS (Fig.5,a-g). The 
bacteriophage is preferably a X bacteriophage. 

The bacteriophage vector according to the present invention, 
however, is not limited to X bacteriophage but other bacteriophage known in 
he art can be used (for example those described in Zabarovski et al., 1993, as 
above). 

In the vector according to the present invention, in alternative to the 
Gateway attB, P, L or R or their derivatives, two lox recombination sites 
flanking RS (for example, two generic loxl and lox2 sites are shown in 
Figure 5, g) can be used. These lox recombination sites can be any mutated 
or derived lox sites, for example a mutated or derived loxP site (for example 
loxP511) as described in Hoess et al., Nucleic Acids Res., 1986, 14(5):2287. 

The vector according to the invention can also comprise two lox 
recombinant sites each of them placed in each arm (or segment portion) of 
CS (Figures 1, 2, and 5,c-f,i), that is, one lox site placed in the CS, at the left 
side of the RS (or of the nucleic acid of interest) and the other lox site in the 
CS, at the right side of the RS (or of the nucleic acid insert of interest); these 
lox recombination sites being capable to recombine with each other. 

These sites can be two lox recombination sites modified, mutated or 
derived lox site (Hoess et al., 1986, as above), preferably a loxP or a 

18 
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modification or derivative thereof. For example, the lox sites can be loxP 
511 (Hoess et al, 1986, as above). AloxP 511 recombines with another loxP 
511 site, but not with a loxP site. All the above variation, mutation, 
modification or derivation of lox site, will be generally indicate as "lox site 
5 and derivative thereof, for the purpose of the present application. 

In this case, after the RS is substituted by the nucleic acid insert of 
interest, the recombination is carried out by a Cre-lox recombinase. 

The Cre-lox recombination system is described in several prior art 
references, for example, Palazzuolo et al., 1990, as above; Elledge et al., 1991, 
10 as above; and Summers et al., 1984, as above. 

In alternative, to the Cre-lox recombinase system, other 
recombination systems can be used for the purpose of the present invention. 
Among them, Kw recombinase (Ringrose L., et al., 1997, FEBS, Eur. J. 
Biochem., 248:903-912), hybrid site-specific recombination system with 
15 elements from Tn3 res/resolvase (Kilbride E., et al., 1999, J. MoL Biol., 
289:1219-1230), )3 recombinase system (Canosa I., et al., 1998, Journal 
Biological Chemistry, Vol.273, No.22, May 29:13886-13891); FLP 
recombinase system (Huffman K.E., and Levene S.D., 1999, J. MoL 
Biol, :286:1-13; and Waite L.L., and Cox M.M., 1995, Journal Biological 
20 Chemistry, Vol.270, N.40:23409-23414). Modification, mutation or derivative 
of these recombination sites can also be used and they will be generally 
indicated as "derivative thereof'. 

The result of this recombination process, mediated by Cre- 
recombinase or other recombinases, is the excision of a plasmid comprising 
25 the nucleic acid of interest. 

According to an embodiment of the invention, the presence of both 
the recombination sites flanking RS for the recombination Gateway-like 
system and the recombination sites in the two arms of CS for Cre-lox, Kw, 

19 
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Tn3 res/resolvase, ]3 recombinase, and FLP recombination, into a vector, 
renders said vector particularly suitable for cloning, transfer of nucleic acid 
material of interest, and preparation of libraries. In fact, according to the 
particular case, the most convenient excision system can be chosen without 
5 changing or modifying the vector. 

According to a further aspect, the cloning vector according to the 
invention can also be used for cloning or for preparing libraries with low or 
no background. Accordingly, the present invention provides a cloning vector 
comprising a construction segment (CS) and a replaceable segment (RS), 
10 wherein said CS is a bacteriophage vector segment and said RS comprises at 
least the ccdB gene as background-reducing system. 

The bacteriophage or plasmid cloning vector according to the 
invention, can also comprises a construction segment (CS) and a replaceable 
segment (RS), wherein said CS is a bacteriophage or a plasmid vector 
15 segment and i) said RS comprises at least a recombination site (capable of 
recombination with the two recombination sites present in the left and right 
arms of CS) as background-reducing system, or ii) RS is flanked by two 
endonuclease asymmetric recognition site sequences which do not ligate with 
each other and are recognized by restriction endonuclease s. 
20 The recombination site comprised into RS must be able to recombine 

with the recombination sites present into the left and right arms of CS, 
therefore, we can address to this RS recombination site as the "third" 
recombination site. 

The "third" recombination site can be a lox recombination site or a 
25 derivative thereof, preferably a loxP site or derivative thereof. 

The two endonucleases asymmetric site sequence background- 
reducing systems can be for example: i) homing endonuclease asymmetric 
recognition site sequences, or ii) asymmetric restriction endonuclease 

20 
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cleavage site sequences recognizable by class IIS restriction enzymes. 

The background-reducing bacteriophage vector has preferably the 
size of CS : 32 kb <; CS ^ 45 kb, advantageously CS is: 36.5 kb <; CS < 38 kb, 
more preferably CS is 37.5 kb. The bacteriophage is preferably a X, 
5 bacteriophage. 

The bacteriophage CS or the vector can comprise a plasmid segment 
at least comprising an ori. The plasmid segment comprising an ori is 
preferably, but not limited to, selected from the group consisting 
of :pBluescript(+), pUC, pBR322 and pBAC, or any plasmid as included into 

10 the NCBI Database, as above. 

In case of the background-reducing plasmid, this can be any kind of 
plasmid known in the art, for example any of the plasmid above indicated or 
disclosed in the NCBI Database. 

This vector preferably comprises at least a selectable marker selected 

15 from the group as above disclosed. In particular, the at least selectable 

marker can be selected from the group consisting of an antibiotic resistance 
gene, an auxotrophic marker, a toxic gene, a phenotypic marker, an enzyme 
cleavage site, a protein binding site; and a sequence complementary to a 
PGR primer sequence. 

20 The background-reducing cloning bacteriophage or plasmid vector 

can also comprise at least one of the recombination system as above 
described, that is i) two recombination sites which do not recombine with 
each other flanking RS (Gateway sites or lox modified sites) and/or ii) at 
least two recombination sites which recombine with each other placed into 

25 the two arms of CS, recognized by a recombinase. These recombination 
sites capable of recombining with each other, are preferably selected from 
the group consisting of : lox sites, Kw, Tn3 res/resolvase, ]3 recombinase 
sites, and FLP sites, as described above. 
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With reference to the background-reducing ccdB system, it has been 
disclosed into plamids by Bernard P. and Couturier M. (1992, J. Mol. Biol., 
226:735-746) and also Walhout et al. (as above) for the Gateway™ vectors. 

The product of the ccdB gene interferes with DNA gyrase. After 
5 recombination, only the plasmids that have lost the ccdB gene (and which 
are recombinant) can grow in E.coli strains not mutated for gyrA, therefore 
providing a selective advantage (see Life Technologies references). 

Plasmids carrying the gene ccdB can propagate only in specific E.coli 
strains. For example in DB3.1, which carries a mutation in gyrA gene 
10 conferring resistance to ccdB (Walhout et al., as above). Therefore, this 

kind of recombination is limited to plasmids, because bacteriophage vectors, 
for instance X substitution vectors, used in cloning systems cannot grow and 
replicate in cells like DB3.1, which lack the recA protein (the recA product is 
required for the growth of substitution-type bacteriophage A,:Sambrook et al., 
15 1989). 

The present inventors have instead surprisingly found that a 
bacteriophage, preferably a X bacteriophage, comprising at least a ccdB gene 
into the RS, according to the invention can propagate and multiply on a 
culture of C600cells. On the contrary, plasmids comprising the ccdB gene 
20 cannot propagate in C600 cells. 

The mechanism of the background-reducing ccdB system in the 
vector of the invention is shown in Figure l,g. 

During the replacement of the RS with the nucleic acid insert of 
interest, it may happen that no replacement occurs or an imperfect ligation 
25 or replacement is realized. In this case, bacteriophage or plasmid vectors 
without complete nucleic acid insert of interest are present in the culture 
creating background. With the presence of ccdB, the "suicide gene", the 
background or byproduct can be reduced about or very closed to zero. 
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A problem of background contamination can also occur during the 
purification, when the removal of stuffer I (RS) is realized on gel (for 
example agarose gel) and fragment of stuffer I nucleic acid is collected with 
CS and can therefore be reinserted into the vectors. 
5 Another background-reducing system is the "third" recombination 

site, which is placed into RS and is capable to recombine with the 
recombination sites present into the left and right arms of CS of the 
bacteriophage or plasmid vector of the invention (Fig.l,i; Fig.5,i). This 
"third" recombination site can be in presence or in absence of the ccdB gene. 
10 Preferably, this background-reducing "third" recombination site is a 

lox site or a derivative thereof, more preferable a loxP site or a derivative, 
modification or mutation thereof, as above described. However, the 
background recombination site present into RS, must be capable of 
recombination with the two recombination sites present in the two arms of 
15 CS. Therefore, in case of recombination mediated by Cre-recombinase, all 
the three sites have to be lox-recombination or derivatives thereof, capable of 
recombining with each other. 

For example, in Figure l,a and l,f, the two recombination sites 
present in the left and right arms of CS (of a bacteriophage or a plasmid 
20 vector) and the background-reducing "third" recombination site into RS 
(stuffer I) are all loxP sites. 

In Figure l.i), it is explained the mechanism of action of the "third" 
recombination site. In case of imperfect ligation of the nucleic acid insert of 
interest, one of the loxP site in arms of CS preferably recombine with the 
25 "third" loxP forming, during the excision step, an excised plasmid, which in 
one case lack the ori and cannot replicate, and in the other case lack the 
selectable marker (Amp in the Figure) and cannot grow up. 

Accordingly, the present invention also relates to a method for 
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cloning or preparing bulk library with low or no background using a 
bacteriophage or plasmid vector comprising at least the "third" 
recombination site as described. 

The background-reducing "third" recombination site can be any 
5 recombination site other than lox, for example the recombination sites used 
for the recombination as above described. 

The background-reducing bacteriophage or plasmid cloning vector 
according to the invention, can also comprises the lacZ gene into RS even in 
presence of the ccdB gene or the "third" recombination site or the like, or in 
10 presence . 

The bacteriophage or plasmid cloning vector according to the 
invention, in alternative or in presence of the background-reducing 
sequences above described, can also comprise two asymmetric sites 
recognized by restriction endonucleases. These two asymmetric site 
15 sequences flank the RS of the vector (Figure 6). 

Asymmetric site sequences useful for the purpose of the present 
invention are: i) two homing endonuclease asymmetric recognition site 
sequences or ii) restriction endonuclease asymmetric cleavage sites 
sequences recognizable by class IIS restriction enzymes. 
20 Homing endonucleases are sold and described by New England 

Biolabs, Inc. A; a description of the asymmetric site sequences is also 
available in the New England Biolabs Catalog. These homing endonuclease 
asymmetric recognition site sequences are from 18 to 39 bp. However, in 
the present invention the recognition site sequences are not limited to those 
25 sequences nor to these sizes. The New England Biolabs Catalog reports 
that after 5-fold overdigestion with I-Ceu-I, greater than 95% of the DNA 
fragments can be ligated and recut with this enzyme. 

Preferably, the restriction homing endonucleases capable of cutting 
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the asymmetric site sequences are selected from the group consisting of: I- 
Ceul, Pl-Scel, PI-PspI and I-Scel. 

Figure 6, a) shows a vector being removed of its RS, bringing two 
homing endonoclease recognition site sequences, which do not ligate with 
5 each other, at the extremities of the CS arms; the RS being removed by using 
the homing endonucleases specific for those site sequences. In Fig.6,b) a 
nucleic acid insert of interest having a pair of homing endonuclease site 
sequences placed flanking said insert of interest (these sequences being the 
same of those of the vector) is provided for the ligation to a vector having RS 

10 removed. In Fig.6,c) one homing endonuclease site sequence of the vector 
recognizes and hybridizes to a complementary homing endonuclease site 
sequence of the insert. In Fig.6,d), the second homing endonuclease site 
sequence of the vector, after a certain time, preferably overnight, recognizes 
and hybridizes the complementary homing endonuclease site sequence 

15 placed on the other extremity of the insert of interest. In conclusion, using 
this system, after a certain time, all the complementary site sequences of the 
inserts recognizes and hybridize with their complementary site sequences of 
the vectors. As consequence, insert-vector ligation is carried out. Both 
insert-insert and vector- vector ligations are not realized since they 

20 extremities are not complementary reducing by-products. With this system, 
also nucleic acid contamination entering the vector is reduced. 

The homing endonuclease recognition site sequences can also be 
placed into a destination vector, preferably a plasmid, and the subcloning 
process can be advantageously carried out. This vector ligates with the 

25 nucleic acid insert of interest, which brings two endonuclease recognition 

site sequences, which are the same of the destination vector, placed flanking 
this nucleic acid insert of interest. 

The same process can be realized when asymmetric site sequences 
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recognized by class IIS endonuclease enzymes are used instead of the 
homing endonuclease site sequences. Examples of class IIS restriction 
enzymes include, AIwl, AIwXI, AJw2GI, Bbsl, Bbvl, Bbvll, Bes£L, Bed, Bcgl, 
BcNI, Binl, Bmrl, Bpml, Bsal, BseRl, BsgL, BsmAI, BsmBI, BspMI, BsrDI, 
5 BstF5l, Earl, Eco31I, Ecobll, Esp3l, Faul, FoM, Gsul, Hgal, HinQVll, Hphl, 
Ksp632I, Mboll, Mmel, MnK, NgoYlll, Plel, RlaAl, Sapl, SfaNl, Taqll, 
TthlYLU, BsriLs, Bsrls, B&mFI, BseMII, and the like (see Szybalski W., et al., 
1991, Gene, 100, 13-26; and Catalog of New England Biolabs, Inc.). 

Examples of recognition sites and cleavage sites of several restriction 
10 enzymes are (into parenthesis are the recognition site and the cleavage site): 
Bbvl (GCAGC 8/12), Hgal (GACGC 5/10), BsmFI (GGGAC 10/14) SfaNI 
(GCATC 5/9), and Bsp I (ACCTGC 4/8). 

The endonuclease asymmetric recognition site sequences as 
described above can be placed into the bacteriophage or plasmid cloning 
15 vector according to the invention also in presence of, the ccdB gene, the lacZ 
gene, and/or the "third" background-reducing recombination site (for 
example lox) into RS. 

The vector ligated with the endonuclease asymmetric system as 
described above can then be excised by any of the recombination system 
20 present in CS, as above described, for example ere -lox recombinase, 

preferably loxP, Kw, FLP, Tn3 res/resolvase, j3 recombinase, etc. The 
vector comprising the endonuclease asymmetric according to the invention, 
therefore, also comprises at least a pair of recombination sites into the CS. 

The RS (or stuffer I) of the cloning vector according to the invention 
25 is removed by the vector and it is replaced by the nucleic acid insert of 
interest with the ligation process. 

The nucleic acid insert of interest which is used in all of the 
embodiments of the present application is selected from the group consisting 
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of DNA, cDNA, RNA/DNA hybrid. Advantageously, long cDNA and 
preferably full-length cDNA. The full-length cDNA is preferably a 
normalized and/or subtracted full-length cDNA. 

Any of the vectors according to the invention has proven to be 
5 particularly useful for cloning nucleic acids of interest and for the 

preparation of library, in particular full-length cDNA library/libraries. 

Accordingly, the present invention relates to a method for cloning at least 
a nucleic acid insert of interest or for preparing at least a bulk nucleic acid 
library of interest, comprising the steps of: 
10 a) preparing at least a cloning vector according to the invention; 

b) replacing RS with a nucleic acid insert of interest into the cloning 
vector obtaining a vector comprising the nucleic acid insert of interest; 

c) allowing the in vivo or in vitro excision of the nucleic acid insert of 
interest or of the plasmid comprising the nucleic acid insert of 

15 interest; 

d) recovering the (recombinant) plasmid carrying the nucleic acid insert 
of interest or the library of (recombinant) plasmids carrying the 
nucleic acid inserts of interest. 

Optionally, between step b) and c), a step of amplification of cloning 
20 vector can be carried out. 

The method according to the invention can also be used for cloning 
nucleic acid insert of interest or for preparing a bulk nucleic acid library of 
interest with reduced or no background. 

Accordingly, the present invention provides a method for cloning a 
25 nucleic acid insert of interest or for preparing a bulk nucleic acid library of 
interest, with low or no background, comprising the steps of: 

(a) preparing at least a cloning vector according to the invention 
comprising a background-reducing system as above described; 
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(b) replacing RS of vector of step (a) with a nucleic acid insert of 
interest; 

(c) allowing the in vivo or in vitro excision of the nucleic acid insert of 
interest or of the plasmid comprising the nucleic acid insert of 

5 interest; 

(d) recovering the (recombinant) plasmid carrying the nucleic acid 
insert of interest and lacking of the background-reducing 
sequence or a library of said plasmids. 

Optionally, an amplification step is carried out between the steps b) 

10 and c). 

The background-reducing system according to the invention can be 
the gene ccdB or a "third" recombination site sequence (capable of 
recombination with the two lox recombination sites present into the left and 
right arm of CS), which is placed into the RS of the bacteriophage or plasmid 
15 vector according to the invention. The "third" recombination site is 

preferable a lox site or derivatives thereof, more preferably a loxP site or 
derivatives thereof 

In case of a Gateway-like method, the gene ccdB is instead placed 
into the RS of a destination vector. 
20 The bacteriophage or plasmid vector or the destination vector can 

also comprise the lacZ gene. 

In Alternative, in the background-reducing method according to the 
invention, the bacteriophage or plasmid vector can comprise two 
endonuclease asymmetric recognition site sequences flanking RS. 
25 Accordingly, the present invention also relates to a method for 

cloning a nucleic acid insert of interest or for preparing a bulk nucleic acid 
library of interest, comprising the steps of: 

(a) preparing at least a bacteriophage or plasmid vector comprising 
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two endonuclease asymmetric recognition site sequences placed 
flanking RS of said vector; 

(b) replacing RS with a nucleic acid insert of interest comprising two 
endonuclease asymmetric recognition site sequences flanking said 

5 insert of interest, said sequences being capable of ligating with 

the two sequences placed into the vector of step a), and obtaining 
a vector comprising the nucleic acid insert of interest; 

(c) allowing the in vivo or in vitro excision of the nucleic acid insert(s) 
of interest or of at least a plasmid comprising the nucleic acid 

10 insert of interest; 

(d) recovering the (recombinant) excised plasmid or destination 
plasmid carrying the nucleic acid of interest or a library of said 
plasmid(s) with low or no background. 

Further, the present invention relates to in vivo and in vitro Cre-lox 
15 recombination system, using the vector according to the invention. 

As discussed in the state of the art section, the Cre-recombinase 
solid-phase in vivo excision (see also Fig. 3 of the present application) known 
in the art (Palazzolo et al., 1990, Gene, 88:25-36) shows drawbacks as low 
plasmid yield (Palazzolo et al., 1990, as above) and plasmid instability; in 
20 fact Cre-recombinase is constitutively expressed causing formation of 

plasmid dimmers/multimers leading to high proportion of plasmid-free cells, 
impairing the sequencing efficiency (Summers et al., 1984, Cell, 36:1097- 
1103). 

A Cre-recombinase liquid-phase in vivo excision, however, has not 
25 been successfully used in the state of the art because in liquid culture, cells 
comprising short plasmids replicate faster than cells comprising very long 
plasmids creating a bias for short plasmids (that is short nucleic acid insert 
of interest), and serious difficulty in obtaining long or full-length nucleic acid 
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inserts. 

The present inventors have surprisingly found that the drawbacks of 
the state of the art could be avoided essentially by allowing an excision of 
plasmids in liquid-phase under condition of very low or no growth 
5 (replication) and amplification, extraction of nucleic acid inserts of interest, 
preparation of different plasmids capable to growth in cells do not expressing 
Cre-recombinase, and further growth (amplification) in solid phase (on plate). 

Accordingly, the present invention provides a method for cloning at 
least a nucleic acid insert of interest or preparing at least a bulk nucleic 
10 acids library of interest comprising the steps of: 

a) preparing at least a cloning vector, comprising a construction 
segment (CS) and a replaceable segment (RS), wherein said CS is 
a bacteriophage vector comprising at least two lox recombination 
sites or derivatives thereof positioned in the left and right arm of 
15 CS.; 

a) replacing RS with a nucleic acid insert of interest into the cloning 
vector; 

b) packaging of the vector; 

c) in vivo in liquid-phase infection of at least a cell expressing cre- 
20 recombinase; 

d) allowing the in vivo in liquid-phase excision of a plasmid 
comprising the nucleic acid insert of interest under condition of 
short-time growth or no growth of the excised plasmid; 

e) carrying out the cellular lysis and recovering the plasmid carrying 
25 out the insert or of a library of these plasmids. 

This method, optionally comprises the steps of: 

f) electroporating or transforming at least a cell, not expressing Cre- 
recombinase, making the plasmid(s) of step f) penetrating into said 
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cell(s); 

g) plating of cell(s) infected as at step g) and recovering the plasmid 
carrying the nucleic acid insert of interest or a library of said 
plasmids. 

5 The electroporation is carried out according to the well-known 

mwthodology in the art. The transformation is preferally carried out by 
chemical treatment, for example, according to Sambrook et al., 1.71-1.84. 

The bacteriophage vector according to this method is preferable a X 
bacteriophage. 

10 The lox recombination sites, which recombine with each other, can be 

any mutated, modified or derived lox site as above described, preferable a 
loxP, which can be mutated, modified or derived (therefore, generally 
indicated as loxP or derivatives thereof). 

The step e) of this method is preferably carried out in 0-3 hours at a 
15 temperature of 20-4°C. The temperature is preferably from room 
temperature to 37°C. 

The present inventors have also developed a new and inventive in 
vitro Cre-lox recombination method. 

In this in vitro method, a bacteriophage vector comprising the nucleic 
20 acid insert of interest is packaged in vitro in presence of (bacterial) 

packaging extract as known in the state of the art (for example, Gigapack® 
or Gigapack Gold® or the like, Stratagene, US). The nucleases present in 
the extract cut the short nucleic acids which have not been packaged and the 
nucleic acid contamination in general. The result is that the nucleic acid of 
25 the vector which has been packaged result purified. 

In a preferred case, when a vector comprising the stuffer II of 5.5 kb 
(or a bacteriophage vector having the size of CS of 37.5 kb) is used, the short 
and not full-length cDNA having sizes below 0.5 kb are not packaged and are 
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removed by the esonuclease. The result is a library with low or without 
bias for short cDNA. This library results to be very useful for the 
preparation of very long and full-length cDNAs. 

Accordingly the present invention provides a method for cloning at 
5 least a nucleic acid insert of interest or at least a bulk nucleic acid library of 
interest comprising the step of: 

(a) preparing at least a cloning vector, comprising a construction 
segment (CS) and a replaceable segment (RS), wherein said CS is 
a bacteriophage vector segment comprising two lox recombination 
sites or derivatives thereof positioned in the left and right arm of 
CS; 

(b) replacing RS with a nucleic acid insert of interest into the at least 
a cloning vector; 

(c) in vitro packaging of the bacteriophage cloning vector of step b) in 
presence of packaging extract; 

(d) extraction of bacteriophage cloning vector(s) from the capside; 

(e) in vitro excision of the plasmid(s) comprising the nucleic acid 
insert(s) of interest from the vector in presence of Cre- 
recombinase; 

(f) recovery of said plasmid or library of plasmids. 
This method may further comprise the steps of: 

(g) electroporating or transforming at lest a cell, not expressing Cre- 
recombinase, making said plasmid(s) entering into said cell(s); 

(h) plating the cell(s) of step g) and recovering plasmid carrying the 
nucleic acid insert of interest or a library of said plasmids. 

Optionally, between the steps c) and d) an amplification step on plate 
of the bacteriophage can be carried out. 

The lox recombination sites can be lox sites mutated, modified or 
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derivative thereof, preferably loxP or derivatives thereof. 

The bacteriophage used in this in vitro Cre-lox method is preferably 
a >, bacteriophage. 

Further, the present inventors have developed a method based on the 
5 Gateway mechanism from transferring nucleic acid insert of interest from 
the vector according to the invention into at least a destination functional 
vector. This functional vector can be utilized for different uses, for example 
for sequencing, for expressing a protein in bacteria or eukaryotic cells, 
making a protein fusion product, and so on. 

10 The Gateway method as already said above is related only to 

plasmids and shows a strong bias for short cDNAs. In the Gateway method, 
cDNAs are amplified by PCR and inserted into the plasmid destination 
vector. However, the reaction times of PCR or full-length cDNAs are very 
long and generally the reaction is carried out overnight, which means low 

15 efficiency and size bias. Fragments with short insert recombine faster than 
fragment with long inserts. Therefore, when mixed, there is always size 
bias, the shortest competes with longer and the short is more efficiently 
cloned causing size bias. 

The present inventors have solved this bias problem of the Gateway 

20 method. 

The method according to the present invention comprises a step of 
ligating nucleic acids of interest (of different size) into the bacteriophage 
vector. 

The bacteriophage vector according to the invention has bigger size 
25 (for example 37.5 kb plus the nucleic acid insert) than the donor vector of the 
Gateway method. A vector having the CS size according to the invention 
does not discriminate between short and long insert and vectors comprising 
both kid of inserts can be amplified and/or excised with a similar efficiency, 
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so that there is no bias for short nucleic acid inserts. 

Accordingly, the present invention provides a "Gateway-like" method 
for cloning at least a nucleic acid insert of interest or for preparing at least a 
bulk nucleic acid library of interest, comprising the steps of: 
5 (a) preparing at least a cloning vector comprising a construction 

segment (CS) and a replaceable segment (RS), wherein said CS is 
a bacteriophage vector segment and RS is flanked by two 
recombination sites, wherein these recombinant sites do not 
recombine with each other; 
10 (b) replacing said RS with a nucleic acid insert according to the 

invention; 

(c) in vitro packaging the at least one bacteriophage cloning vector of 
step b); 

(d) allowing the in vitro excision of the nucleic acid insert of interest 
15 by providing to the cloning vector of step c) at least a destination 

vector comprising a destination replaceable segment (RS) flanked 
by two recombination sites, which are capable of recombining 
with the recombination site of cloning vector (s) of step (a); 

(e) recovering a recombinant plasmid carrying the nucleic acid insert 
20 of interest or a library of said plamids. 

Preferably, the bacteriophage is a X bacteriophage. 

The two recombination sites which do not recombine with each other 
flanking the RS of the bacteriophage cloning vector or of the destination 
vector, can be i) recombination sites selected from the group consisting of 
25 attB, attP, attL, and attR or derivatives thereof, or ii) lox recombination site 
or derivatives thereof, preferably loxP or derivative thereof (for example loxP 
and loxP511). 

After the nucleic acid of interest has been transferred into the 
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destination vector using the Gateway technology, said acid nucleic of interest 
can be transferred in a further destination or receiving vector according to 
the following procedures named as: i) GW direct; ii) GW indirect; and iii) GW 
amplification method, according to Fig.3 and to the examples. 
5 The excised plasmid or destination plasmid bringing the nucleic acid 

insert of interest according to the invention can be used as driver in a 
normalization and/or subtraction method. 

A method for normalization and/or subtraction of a cDNA library, 
preferably a full-length cDNA library, has been disclosed by Carninci et aL, 
10 2000, Genome ites.,10:1617 -1630. 

Accordingly the present invention relates to a method for preparing at 
least a normalized and/or subtracted library comprising the steps of: 

(a) providing at least a plasmid excised or a destination plasmid 
prepared according to the method of the present invention; 
15 (b) providing the plasmid of step b) to a pool of nucleic acid targets; 

(c) removing the plasmid/target hybrids; 

(d) collecting the normalized and/or subtracted nucleic acid targets, 
which did not hybridize to the plasmid of the invention. 

According to an embodiment, the plasmid of step a) is rendered as single 
20 strand. For example, it is treated by making at least a nick into one strand 
of the double stranded plasmid. Then, the strand which has been nicked is 
removed, finally steps (c)-(d) are applied. 

Preferably, the nick is introduced by using the protein Genell (Gene- 
trapper Kit, Gibco, Life Technologies, US) and the strand which has been 
25 nicked is removed by an exonuclease. The exonuclease is preferably ExoIII. 

According to a further embodiment, the present invention relates to 
a method for preparing at least a normalized and/or subtracted library 
comprising the steps of: 
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(a) providing at least a vector according to the invention comprises a 
construction segment (CS) and a replaceable segment (RS), 
wherein CS comprises a Fl ori; 

(b) replacing RS with a nucleic acid insert of interest according to the 
5 invention; 

(c) adding an helper phage and producing a number of a single 
strand DNA (ssDNA) vector copies, secreted from the cells; 

(d) providing the copies of step c) to a pool of nucleic acids targets; 

(e) removing the plasmid/target hybrids; 

10 (f) collected the normalized and/or subtracted nucleic acid targets, 

which did not hybridize with the tar get (s). 
Helper phage is preferably obtainable from Stratagene. A more 
detailed description of a method for preparing ssDNA vector, consisting in 
infecting the bacterial cells with a helper phage (Stratagene catalog), then 

15 recovering the single strand plasmid secreted from the cell, extracting the 
DNA, and finally recovering the DNA from single strand plasmid can be 
found in the Stratagene User Manual of pBluescript. A method using the 
helper phage for reducing the vector at single strand is also described in 
(Bonaldo et al, 1996, Genome Res., 6:791-806). 

20 When using the fl(+) origin of replication, an helper phages such as 

R408 can be used (Short et aL, 1988, as above). 

The bacteriophage vectors according to the invention can be prepared 
using any kind of plasmid or plasmid fragment known in the art, for 
instance pBluescript(+), pUC, pBR322, bacterial artificial chromosome 

25 plasmid (pBAC), pBeloBACll (Kim et aL, 1996, Genomics, 34:213-218, a 
modified or derivative pBeloBACll according to US 5,874,259 (herein 
incorporated by reference), or any other plasmid as listed public database or 
available from Company' s Catalogues as above indicated. 
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Acording to one embodiment, the invention provides a bacteriophage 
vector comprising a bacterial artificial chromosome (pBAC) or pBAC 
derivative or a segment thereof comprising at least an origin of replication 
(ori). The bacteriophage is preferably a X bacteriophage. The ori can 
5 preferably be an ori capable of maintaining the plasmid at single copy. 

The pBAC or segment thereof, comprised into the bacteriophage, 
may further comprise: 

- a site into which an DNA fragment can be cloned; 

- at least one pair of inducible excision- mediating sites flanking the site into 
10 which the DNA fragment can be cloned, the excision-mediating sites being 

provided in parallel orientation relative to one another and defining an 
excisable fragment that comprises the site into which the DNA fragment can 
be cloned. The pair of inducible excision-mediating sites can be, for 
example, sites provided in parallel orientation relative to one another (see 
15 US 5,874,259). The pair of excision-mediating sites are preferably FRT 
sites. The bacteriophage may further comprises into pair of excision- 
mediating sites a sequence as shown in SEQ ID NO:45 (according to US 
5,874,259). 

The pBAC or segment thereof, comprised into the bacteriophage, 
20 may further comprise an inducible origin of replication, preferably oriV. 
Thus oriV may be induced to produce multiple copies of the BAG plasmid 
(the pBAC is usually present at single copy). 

This bacteriophage can comprise one or more of the recombination 
sites described in the present application. For example, this bacteriophage 
25 may comprise at least two recombination sites selected from the following: 
(a) two recombination sites, wherein either site does not recombine with the 
other; (b) two lox recombination sites, wherein either site is capable of 
recombining with each other; (c) two homing endonuclease asymmetric 
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recognition site sequences; (d) two restriction asymmetric endonuclease 
cleavage site sequences, wherein either site sequence does ligate with the 
other, recognizable by class IIS restriction enzymes. 

The two recombination sites (a) may be selected from the group 
5 consisting of attB, attP, attL, attR and derivatives thereof. 

The two recombination sites (a) may also be lox recombination sites 
derivative, which do not recombine with each other. 

The two recombination sites (b) are preferably loxP sites. 

The two homing endonuclease site sequences (c) are preferably 
10 selected from the group consisting of: I-Ceul, Pl-Scel, PI-PspI, and I-Scel. 

The excision used can be any excision system, included those 
described in Figure 3. 

The bacteriophage may further comprise at least a background- 
reducing sequence, for example: a) the ccdB gene; b) the lacZ gene; c) a lox 
15 sequence. 

It is also provided a method for cloning a nucleic acid of interest or 
for preparing a bulk nucleic acid library of interest comprising the steps of: 
(a) preparing a bacteriophage cloning vector comprising a pBAC (or a 
pBAC derivative) or a fragment thereof: 
20 (b) inserting a nucleic acid of interest into the bacteriophage cloning 

vector; 

(c) allowing the in vivo or in vitro excision of the plasmid (pBAC or 
derivative thereof) comprising the nucleic acid insert of interest; 
and 

25 (d) recovering the BAG plasmid carrying the nucleic acid insert of 

interest or a library of these BAG plasmids. 
The present invention also relates to a kit comprising at least a 
cloning vector or at least a library of vectors according to the invention. 
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The present invention will be further explained more in detail with 
reference to the following examples. 
Examples 
Bacterial strains 

5 The following not limitative list of bacterial strains were used in the 

following examples : C600, F thi-1 thr-1 leuBQ JacYl tonA21 supE44-X~; XL1- 
Blue-MRA(P2), A(zncrA)183 A (mciCB-hscBMR-mrr)l73 endAl $upE44 
thi-1 gyrA96 relAl lac (PZlysogen); DB3.1, F" gyrA462 endA A(srl-recA) 
mcrB mrrhdsS20(v B? m B ") supE44 ara-14 gaM2 lacYl proA2 rp$L2Q xyJ-5 A " 

10 leu mtR; BNN132, e!4'(McrA) A (lac-proAB) thi-1 gyrA96 endAl hsdRll 
relAl supE44 [F traD36 proAB ladZ AMIS] constitutively expressing Cre- 
recombinase (Elledge et aL, 1991, Proc. Natl. Sd. USA, 88:1731-1735); and 
DH10B, F" mcrA Aimrr-hsdRMS-mcrBC) 080 ladZ AM15 AlacX74 dedR 
recAl endAl araD139 (ara-leii)lQ91 galU galKX ' rpsL nupG (these 

15 bacterial strains are all commercially available). 
Structure and nomenclature of X.-FLC vectors 

The basic name of the constructed vectors used in the present 
description derives from full-length &DNA; the roman numerals indicate: I, 
general use; II, presence of Gateway sequence (Life Technology); and III, 

20 presence of homing endonuclease sites. L and S indicate whether the 

cloning capacity of the vector better accommodates long (size -selected) or 
short cDNAs. B, C, D, E, and F indicate the type of stuffer I, as described in 
Figures lb— £ 

Ba sic components of )l-FLC vectors 
25 We constructed a series of Abased cloning vectors for broad-size 

directional cloning of full-length cDNAs. These A.-FLC vectors can 
nominally package inserts of approximately 0.2 to 15.4 kb. 

Another benefit of our A,-FLC vectors is that they accommodate 
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cloning and bulk-excision of short and long cDNAs at similar efficiencies 
within the same library. Then, we adapted these vectors for additional 
purposes, for example, for selecting very long or full-length cDNAs by using 
the stuffier II of 5.5 kb (that is a complete size of the construction segment 
5 CSof 37.5 kb). 

The components used to construct the vectors were assembled to 
produce several constructs shown in Figures 1 and 2. 

Figure la illustrates the general scheme for the assembly of the X- 
FLC vectors and excision into a plasmid library by using Cr<9-recombinase or 
10 Gateway recombination system. 

The basic structure of the X-based vectors according to the present 
invention, consists of the left and right X-arms, which are functionally the 
same as those of A,-2001 (Karn et al., 1984, Gene, 32:217-224). Between the 
left and right arms, we inserted a stuffer (stuffer I) and a modified 
15 pBluescript or pBAC, flanked on both sides, by two loxV sites for the bulk 
excision of the plasmid cDNA library, analogous to the structure of A.-PS 
(Nehls et al., 1994a, as above). 

An example of pBluescript construct is shown in Fig. 13 and SEQ ID 

NO:51. 

20 The calculated size of the A, arms plus the plasmid, but excluding 

stuffer I (which is substituted with the cDNA in a library) and stuffer II, is 
about 32 kb. Stuffer II is the "cloning size regulator" and determines the 
size of the insert, given that the nominal lambda packaging capacity 
(Zabarovsky et al., 1993, Gene, 127:1-14). When stuffer II is 5.5 kb long, as 

25 in several constructs presented here, the size of the vector, excluding stuffer 
I, (that is the size of the construction segment CS) is calculated to be 37.5 kb. 
As reported in Tablel, the vector having a stuffer II of 5.5 kb (CS size of 37.5 
kb) is particularly useful in selecting long and full-length cDNAs compared 
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to the use of the same vector having a stuffier II of 6 kb (CS size of 38 kb). 

Alternative stuffer II elements of 0 and 6.3 kb or even more, were 
also used to shift the cloning size and collect wide range size of cDNAs. 

Type I stuffers (Figs. Id- f) can contain the background indicator 
5 LacZ and a background-reducing element, such as the ccdB toxic element or 
an additional loxV site, which separates the antibiotic resistance gene and 
the origin of replication during excision (Fig. li). 

All of the excised plasmids contain conventional forward (Fwd) and 
reverse (Rev) primer sequences and T7/T3 RNA polymerase promoters, to 
10 allow transcriptional sequencing (Sasaki et al., 1998, Proa Natl Acad. Sci. 
USA, 95:3455-3460) and transcription (Figs. 2g-j, underlined sequences). 

In addition, all plasmids can be used to produce single-stranded DNA 
(ssDNA), and all of them carry the fl(+) origin (Short et al., 1988, as above). 
When using the fl(+) origin of replication with helper phages such as R408 
15 (Short et al., 1988, as above) to rescue ssDNA, the strand that is rescued is 
the opposite of the strand represented in Figs. 2g-j. 

In some constructs, we have also introduced cloning or recombination 
sites such as Gateway sequences flanking RS or the cDNA of interest or 
placing site sequences for homing endonucleases (New England Biolabs, Inc. 
20 also indicated as NEB) for bulk or individual excision of the cloned insert. 
Example 1: Construction of vectors 

Any vector according to the invention was generated by following 
standard molecular biology techniques (Sambrook et aL, 1989) and using the 
components shown in Figures. The k arms (that is the portions at left and 
25 right side of Stuffer I) in vectors according to the invention were derived 

from *,-PS (Nehls et al., 1994a, as above) and were originally described for X- 
2001 (Karn et aL, 1984, Gene, 32:217-224). Into the Xbal site in the left 
arm of A,-PS, we inserted a 5.5-kb genomic fragment obtained by PCR 
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amplification of mouse genomic DNA that was cleaved with Xbal and to 
which was ligated a linker/primer adapter containing an AscL restriction site 
for later removal or modification of the insert: the linker/primer upper 
oligonucleotide is : 5" -CTAGGCGCGCCGAGAGATCTAGAGAGAGAG (SEQ 
5 ID NO: 9); the lower oligonucleotide is: 

5' -CTCTCTCTCTAGATCTCTCGGCGC-3' (SEQ ID NO:10). The upper is 
also used for PGR amplification. 

Before PGR amplification, the genomic DNA also was cleaved with 
Xhol, SaR, and SfiL to eliminate these sites from the amplified fragment. 

10 The amplification and agarose gel-purification steps (Boom et aL, 1990, J. 
Clin. Microbiol, 28:495-503) were repeated 3 times. The 5.5-kb fragment 
size was chosen as the size regulator (stuffer II) for the X-FLC-I-B vector, 
and its derivatives were created by cloning similarly obtained fragments of 
approximately 4.5 to 5.5 kb and we verified that inserts as short as 0.5 kb 

15 were clonable. In addition, the sequences of the polylinkers (sequences as 
appears in the excised plasmids of Figure 2) and stuffer I (Fig.l) were 
changed to accommodate directional cloning (according to Standard 
molecular biology techniques, for example Sambrook et aL), basically, 
restriction digestion, followed by re-ligation (T4 DNAligase) with linker 

20 having the desired sequences which are inserted between the previous 

fragments of the phage. The 10-kb stuffer I (Fig. lb) was obtained from A,- 
PS (Nehls etal y 1994a, as above). The 3-kb shorter fragment of the stuffer 
(Fig.lc) was obtained by digesting the 10-kb stuffer I with Xhol and SaR. 
Subsequently, we amplified this 3-kb with the primers 5 ! -GAGAGACTC- 

25 GAGGTCGACGAGAGAGGCCCGGGCGGCCGCGATCGCGGCCGGCCA- 

GTCTTTAATTAACT-3' (SEQ ID NO: 11) and 5'-GAGAGAGGATCCGAGAGA- 
GGCCAGAGAGGCCATTTAAATGCCCGGGCTGCAGGAATTCGATAT-3 ! 
(SEQ ID NO: 12) to add several restriction sites to the 3-kb stuffer (Fig. lc). 
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To this modified stuffer (Fig. lc), we inserted the blunt-ended LadL cassette 
into the Swal site. Then, we restricted the modified stuffer with SfH and 
inserted the ccdB gene as a triple ligation to obtain the stuffer I in Figure le. 
The ccdB gene was obtained by PCR amplification of the template pDEST-C, 
5 which can be propagated in & coli DB3.1 (Life Technologies); the primer 
pairs were S'-GAGAGAGCGGCCGCCCGGGCCATTTAAATCCGGCTTACT- 
AAAAGCCAGA-3' (SEQ ID NO: 13) and the reverse primer 5'- 
AGCGGATAACAATTTCACACAGGA-3' (SEQ ID NO:14)(as in pBluescript, 
Stratagene), and 5 ! -GAGAGAGGCCTCTCTGGCCACTAGTCTGCAGAC- 
TGG CTGTGTATA- 3 1 (SEQ ID NO:15) and the forward primer 5' - 
TGTAAAACGACGGCCAGT-3' (SEQ ID NO:16). The LadL cassette was 
obtained by digesting a pUC18 with Nael and AMU and then blunting the 
appropriate fragment by using the Klenow fragment of DNA polymerase 
before cloning. 

Lox¥, at£B, and the modified polylinker sequences were prepared by 
annealing complementary oligonucleotides. 

The stuffer I of Figure le, after blunting the SaR and BamBI 
restriction sites, was dimerized by ligation with DNA ligase (New England 
Biolabs) to obtain the stuffer in Figure Id. The stuffer in Figure If was 
obtained by PCR amplifying the stuffer in Figure lc with a primer 
containing the LoxP site, 5 ! -GAGAGAGGATCCAGAGAGATAACTTCGTAT- 
AATGTATGCTATACGAAGTTATGAGAGAGGCCAGAGAGGCCATTTAA-3' 
(SEQ ID NO: 17)(on the BamHI side), and the primer 5 ? -GAGAGACTCGAG- 
GTCGACGAGAGAGGCCCGGGCGGCCGCGAT- 

CGCGGCCGGCCAGTCTTTAATTAACT-3 1 (SEQ ID NO: 18) (on the SaR side). 
After purification (according to Boom et al., 1990, as above) and restriction 
digestion, this fragment was ligated with DNA ligase (according to Sambrook 
et al., 1989) to the ccdB fragment to yield the stuffer in Figure If. 
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The plasmids obtained after excision (described later) are derivatives 
of pBluescript* (Stratagene) or pBAC. The pDEST-C vector (Life 
Technologies) is the acceptor plasmid of the LxR reaction (Gateway System, 
Life Technologies) and, after excision, produces pFLC-DEST (Fig.2.j). 
5 pDEST is prepared from pBluescript II SK+ (Stratagene) by removal of the 
polylinker by digesting the pBluescript II SK+ with the restriction enzymes 
SacI and KpnI. Then, blunting the cleaved extremities with T4 DNA 
polymerase (according to Sambrook et al., 1989). The rfB II cassette 
(purchased by Life Technologies) comprising the ccdB gene was then inserted 

10 and ligated into the cleaved plasmid following the instruction of Gateway 
Cloning System Manual, Version 18.4, Life Technologies. The ligated 
plasmid vector was then cleaved with BssHI restriction enzyme and the 
cleaved fragment inverted (that is rotated of 180 degrees) and re-entered 
into the vector (according to known methodologies, Sambrook et al, 1989). 

15 The pDEST-C vector was used in the same way as is pDEST12.2 

(Catalog and Instruction Manual, Gateway™ Cloning Technology, 
GIBCOBRL®, Life Technologies®). 

The X-FLC-I-B vector was in general used as starting point for the 
construction of the other vectors according to the invention. 

20 X-FLC-I-E was obtained by substituting the stuffer in Figure le for 

that of A.-FLC-I-B. X-FLC-I-L-B was obtained by removing stuffer II from X- 
FLC-I-B, and X-FLC-I-L-D was created by substituting the stuffer shown in 
Figure le for that of X-FLC -I-B. X-FLC-II-C was obtained by joining a 
modified pBluescript II KS + (purchased from Stratagene) with a stuffer 

25 like that in Fig. lc; the rest of the vector was as in k-FLC-I-B. k-FLC-III-F 
was created by inserting a construct containing the plasmid sequence and 
stuffer I of Fig. If (the construct is shownFigure 2d) into k-FLC-I-B-derived . 
phage arms (including the 5.5-kb stuffer II) in the same way as described in 
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the example "preparation of A,-FLC-III-C (but introducing the stuffer If 
instead of the stuffer lc). The vector X-FLC-III-F was also prepared as 
shown in Fig.7. X-FLC-III-L-D was obtained from k-FLC~III-F by first 
substituting the stuffer I of Fig. If with the one of Figure Id, followed by 
5 deletion of stuffer II. X-FLC-III-S-F was obtained by ligating (using DNA 
ligase, as described in Sambrook et aL, 1989) the concatenated arms from X- 
FLC-I-B (devoid of stuffer II) with a 6.3 Kb long stuffer II and the 
"plasmid+stuffer T derived from X-FLC-III-F. Vector X-FLC-III-E was 
prepared in the same ways as described for X.-FLC-III-F (and A,-FLC-III-C) 

10 introducing the stuffer le instead of the stuffer lc or If; with "stuffer le" it is 
intended the stuffer I of Fig.le, and the like for the other stuffers). Vectors 
comprising a pBAC or pBAC derivative can be prepared as shown in 
Example 20 and according to Figures 9-12. 
Example 2 : Preparation of Alarms for cloning 

15 The final X,-DNA constructs were prepared by using standard 

methods (Sambrook et al, 1989) or the Lambda Maxi Prep Kit (#12562, 
Qiagen). The cohesive termini (cos ends) of 10 |ug of X-DNA were annealed 
by incubating for 2 h at 42°C in 180 ^1 10 mM Tris -CI (pH 7.5)/10mM MgCl 2 . 
We then added 20 jxL lOx ligation buffer and 400 U T4 ligase (New England 

20 Biolabs) and incubated the mixture for 5 h at room temperature. The ligase 
was inactivated by incubating for 15 min at 65°C. 

At this point, the X-DNA was digested with the required restriction 
enzymes (as described below; all purchased from New England Biolabs) in 
3 steps because of the different concentrations of NaCl needed. For the first 

25 step, restriction was done in 50 mM NaCl by the addition of 2 jxL 5 M NaCl, 
6 U Fsel, and 8 U Pad for each vector. The sample (the vector) was 
incubated for 4 h or overnight at 37°C. The second step was done in 100 
mM NaCl by adding 2 jliL 5 M NaCl, 30 ^iL 10x NEB 3 buffer, 270 |xL H 2 Q, 
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and 20 U Swal to the previous reaction and incubating for 2 h at room 
temperature. After this step, the reaction tube was heated for 15 min at 
65°C. Finally, the third step was done in 150 mM NaCl by adding 5 jllL 5 M 
NaCl, 40 U Xhol (in the cases of the ^,-FLC-I and -III vectors, to reduce the 
5 background by reducing the size of the K coli genomic DNA fragments; and 
for the X-FLC-II vectors, to create the cloning site), 40 U SaR, and 40 U 
BamHI to the heat-inactivated reaction and incubating for 4 h at 37°C. For 
^-FLC-II vectors, the SaR may be omitted or may be used to generate an 
alternative to the Xhol cloning site. The Fsel, Pad and Swal step are 

10 omitted for the X-FLC-I-B, which does not carry these sequences. 

After restriction, the DNA was purified by proteinase K treatment in 
the presence of 0.1% SDS and 20 mM EDTA, extracted with 1:1 
phenol/chloroform and chloroform, and precipitated with ethanol (Sambrook 
et al., 1989). To avoid problems during resuspension, the DNA concentration 

15 did not exceed 20 jxg/mL. 

After careful resuspension for at least 30 min, the digested DNA was 
separated in a 0.6% low-melting point agarose gel (Seaplaque®, FMC) 
according to the followings steps. The wells were in the middle of the gel. 
After electrophoresis for 1.5 h at 8 V/cm, the DNA fragments of the Styl- 

20 digested X,-DNAthat were shorter than 19 kb were cut from the gel and 

discarded (step 1). Then, the electrophoresis buffer lx TBE (electrophoresis 
buffer Tris-Borate-EDTA ; see Sambrook et al., 1989) was replaced with fresh 
buffer, and the DNA remaining in the gel was electrophoresed in the 
opposite direction at 8 V/cm for 2.5 h. Then the DNA shorter than 19 kb 

25 again was discarded (step 2). The buffer was changed again. To condense the 
region containing the A,- arm DNA to decrease reaction volumes, the DNA 
remaining in the gel was electrophoresed at 8 V/cm for 30 min in the same 
direction as for step 1. Finally, the portion of the gel containing the X-arm 
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DNA was removed (step 3), the gel was equilibrated with TE buffer 
(Sambrook et al., 1989), and the A,- arms were purified and checked as 
described (Carninci and Hayashizaki, 1999, Methods Enzymology, 303:19- 
44) by using /3 -agarase (New England Biolabs). We typically recovered 30% 
5 to 50% of the starting X-DNA. The purified Alarms were stored indefinitely 
in single-use aliquots at -80°C or at +4°C for up to 1 week. A typical cloning 
efficiency was 1—2 x 10 7 pfu/jig X-FLC-I-B vector with a test insert of 6 kb 
and less than 1% background of non-recombinant clones. 
Example 3 : Preparation of X,-FLC-I-B 
10 A -PS vector has been cleaved using BamHI restriction enzymes and 

stuffer I inserted using a left linker adapter comprising two complementary 
oligonucleotides: upper oligonucleotide 

5' -GATCAGGCCAAATCGGCCGAGCTCGAATTCG-3' (SEQ ID NO: 19) and 
lower oligonucleotide 5 ? -TCGAGAATTCGAGCTCGGCCATTTGGCCT-3' 
15 (SEQ ID NO:20), and a right linker adapter comprising two complementary 
oligonucleotides: upper oligonucleotide 

5 ? -GATCAGGCCCTTATGGCCGGATCCACTAGTGCGGCCGCA-3' (SEQ ID 
NO:21) and lower oligonucleotide 

5 ? -TCGATGCGGCCGCCTAGTGGATCCGGCCATAAGGGCCT-3 , (SEQ ID 
20 NO:22). 

Each one of two oligonucleotides of the left adapter, that is SEQ ID 
NO: 19 and SEQ ID NO:20 was treated with Kinase with cold ATP for 20 min 
at 37°C as follows: 1 \xg of each oligonucleotide, 1 fxl of ATP 5mM, 2 [xl of PNK 
buffer (New England Biolabs), 0.5 ^1 of PNK (Polynucleotide Kinase; New 
25 England Biolabs), and water up to 20 [xl. The obtained products were the two 
complementary oligonucleotides 5 ? -phosphorilated. The two oligo (SEQ ID 
NOS:19 and 20) solutions were mixed together and NaCl added to a final 
concentration of 100 mM. The mixer was incubated 15 min at 65 ° C and 
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then for 10 min at 45°C to carry out the annealing. The annealed oligos were 
diluted at the concentration 0.5 ng/[xl suitable for cloning. The same 
procedure was carried out for the oligo pair (SEQ ID NOS: 21 and 22) which 
were also annealed forming the right adapter. 

200 ng of X-PS vector above cleaved with BamHI (that is the left and 
the right arms) were mixed with 0.4 ng of the left adapter and 0.4 ng of the 
right adapter, and 60 ng of the stuffer I, in a final volume of 5 The 
ligation was carried out overnight (alternatively the ligation can also be 
carried out for 2 hours and 16°C). The ligated vector/adapters/stuffer I was 
packaged according to the methodologies known in the art Sambrook et al., 
1989). 

A stuffer II of 5.5-kb genomic fragment obtained by PCR 
amplification of mouse genomic DNA that was cleaved with Xbal was ligated 
at both extremities with a linker/primer adapter containing an AscI 
restriction site for later removal or modification of the insert. The 
linker/primer upper oligonucleotide is : 5"- 

CTAGGCGCGCCGAGAGATCTAGAGAGAGAG (SEQ ID NO:9); the lower 
oligonucleotide is: 

5' -CTCTCTCTCTAGATCTCTCGGCGC-3' (SEQ ID NO:10). 

The stuffer II with the adapter was introduced into the Xbal site in 
the left arm of X vector above prepared, obtaining the vector A,-FCL-I-B. 

From this vector after the excision with in vitro Cre-lox recombinase 
(as described later), the plasmid pFLC-I-b (the plasmid of Fig.2g comprising 
the stuffer I of Fig. lb) was obtained. 
Example 4 : Preparation of )JFLC-III-C 

Plasmid pFLC-I-b ? obtained from excision of A,-FLC-I~B as described 
above, was used as template and amplified by PCR. The primers used were: 
T7 Rev (56 mer) 
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5'-GTGTGATATCGCCCTATAGTGAGTCGTATTACATAGCTGTTTCCTGTGT 
GAAATTG-3' (SEQ ID NO:23) and T3 Fwd (70 mer) 

5'-GAGAGATATCTTTGTTCCCTTTAGTGAGGGTTAATTGCGCGCAATTCA 
CTGGCCGTCGTTTTACAACGTC-3' (SEQ ID NO:24) obtaining the linear 
5 "product 1". 

Plasmid pFLC-IIc was used as a template and amplified by PCR. The 
primers used were: FLCIIX2 (68 mer) 

5'-GAGAGACTCGAGGTCGACGAGAGAGGCCCGGGCGGCCGCGATCGCG 
GCCGGCCAGTCTTTAATTAACT-3' (SEQ ID NO:25) and primer FLCIIB2 
10 (63 mer) 

5'-GAGAGAGGATCCGAGAGAGGCCAGAGAGGCCATTTAAATGCCCGGGC 
TGCAGGAATTCGATAT-3' (SEQ ID NO:26). The product of this PCR was 
cleaved with Xhol and BamHI restriction enzyme obtaining a linear 
fragment of 3 bk. This fragment was used as template for PCR amplification 
15 with the primers: 5' I-Ceul-Sall (59 mer) 

5'-GTGTAACTATAACGGTCCTAAGGTAGCGAGTCGACGAGAGAGGCCCG 
GGCGGCCGCGAT-3' (SEQ ID NO:27) and 3' PI-Scel-BamHI (67 mer) 5'- 
GCATCTATGTCGGGTGCGGAGAAAGAGGTAATGAAATGGCAGGATCCGA 
GAGAGGCCAGAGAGGCCA-3' (SEQ ID NO:28), obtaining the hnear 
20 "product 2". 

The "product 2" was then phosphorilated with PNK-polynucleotide 
kinase and gamma-ATP according to Sambrook et al., 1989. 

Then, the "product 1" was cleaved with the EcoRV restriction enzyme 
and the fragment obtained was ligated (according to the standard 
25 methodology, Sambrook e al., 1989) with the "product 2" prepared as above. 
A (circular) plasmid indicated as "product 3" was obtained. 

The plasmid "product 3" was used as template and amplified by PCR 
using the primers: Xbal-LoxP Tag primer 3F (69 mer) 
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5 5 -GAGAGTCTAGATAACTTCGTATAGCATACATTATACGAAGTTATAAATC 
AATCTAAAGTATATATGAGT-3' (SEQ ID NO:29) and Xbal-LoxP Tag primer 
3R (69 mer) 

5 ? -GAGAGTCTAGATAACTTCGTATAATGTATGCTATACGAAGTTATAAAAC 
5 TTCATTTTTAATTTAAAAGG -3' (SEQ ID NO:30) obtaining a linear product, 
which was then cleaved with Xbal restriction enzyme, obtaining the linear 
"product 4". 

A A,-FLC-I-B was cleaved with Xbal restriction enzyme, then purified 
with electrophoresis according to the standard methodology (Sambrook, et 

10 al., 1989) and the resulting X left arm, X right arm, and stuffer II were 

recovered from the purification by electrophoresis. 200 ng of X left arm, 90 ng 
of X right arm, 55 ng of Stuffer II, and 60 ng of the "product 4" were ligated 
overnight according to the standard methodology (Sambrook et al., 1989). 
The obtained vector X-FLC-III-C was packaged according to the 

15 methodologies known in the art (Sambrook et al., 1989). 

By treatment with Cre-recombinase, the in vitro cre-lox recombinase 
excision was carried out and the plasmid pFLC-III-c (plasmid of fig.2i 
comprising the stuffer I of Fig.lc)) obtained. 

Other X-FLC vectors can be prepared starting from X-FLC-III-C 

20 vector. For example, vector X.-FLC-III-F or A,-FLC~III-E can be prepared by 
substituting the stuffer Ic of ^-FLC-III-C with the stuffer If or Ie, 
respectively. 

Example 5 ; Pre pa ration of A,-FLC-H - C 

pBluescript II SK+ (purchased from Stratagene) was digested with 
25 Kpn I and Not I. The large fragment was separated by agarose gel 
electrophoresis and purified. 

X-FLC-I-B was digested with Xhol and Sail and blunted by T4 DNA 
polymerase, according to standard methodology (Sambrook et al., 1989). A 3 
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kb fragment was separated by agarose gel and purified. 

Then three double stranded linkers (AttBl, AttB2 and LoxP) were 
synthesized as follows. 
AttBl linker: upper oligonucleotide is 
5 & -CGGGCCACAAGTTTGTACAAAAAAGCAGGCTCTCGAGGTCGACGAGA 
GGCCAGAGAGGCCGGCCGAGATTAATTAA-3' (SEQ ID NO:31), lower 
oligonucleotide is 

5'-TTAATTAATCTCGGCCGGCCTCTCTGGCCTCTCGTCGACCTCGAGAGC 
CTGCTTTTTTGTACAAACTTGTGGCCCGGTAC-3' (SEQ ID NO:32). 
10 AttB2 linker: upper oligonucleotide is 

5'-GGCCATGACGGCCGAGAGATTTAAATGAGAGAGGATCCACCCAGCTT 
TCTTGTACAAAGTGGTCTAGACCTCTCTTGG-3 ? (SEQ ID NO:33), lower 
oligonucleotide is 

5'-GAGGTCTAGACCACTTTGTACAAGAAAGCTGGGTGGATCCTCTCTCAT 
15 TTAAATCTCTCGGCCGTCATGGCC-3' (SEQ ID NO:34). 
LoxP linker: upper oligonucleotide is 

S'-CCGCATAACTTCGTATAGCATACATTATACGAAGTTATGC-S 5 (SEQ ID 
NO:35), lower oligonucleotide is 

S^GGCCGCATAACTTCGTATAATGTATGCTATACGAAGTTATGCGGCC 
20 GA-3' (SEQ ID NO:36). 

The lower strand of attB2 linker and the upper strand of LoxP linker 
were phospohorylated by using polynucleotide kinase PNK; New England 
Biolabs) according to how described above in the preparation of X-FLC-I-B. 

The two oligos (SEQ ID NO:31 and 32) solutions were mixed together 
25 and NaCl added to a final concentration of 100 mM. The mixer was 

incubated 15 min at 65°C and then for 10 min at 45°C to carry out the 
annealing. The annealed oligos were diluted at the concentration 0.5 ng/jxl 
suitable for cloning. The same procedure was carried out for the oligo pairs 
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(SEQ ID NO: 33 and 34; and for SEQ ID NO:35 and 36) which were annealed 
respectively. AttB2 linker (0.5 ng ) and LoxP linker (0.5 ng) were mixed and 
ligated in the volume of 5 The tube was incubated at 16 ° C. After 20 min, 
attBl linker (0.5 ng ), pBluescript cleaved with KpriL and JVotl (25 ng) and 
5 the 3 kb fragment from A,-FLC-I-B (25 ng) were added in the tube in the 
volume of 10 |llL Then, it was incubated overnight at 16°C obtaining a 
ligation solution comprising a plasmid comprising the ligated fragment. The 
ligation solution comprising a plasmid was then introduced by 
electrophoresis into DH10B cells and plated on a medium. Plasmids was 
10 prepared from the recombinant cells. The cells were lysed and the plasmids 
cleaved with Xbal and a plasmid fragment was obtained "fragment 1". 

A junction linker was prepared, having an upper oligonucleotide: 5'- 
GGCCATGAGAT-3' (SEQ ID NO:37), and a lower oligonucleotide is: 5 ? - 
CTAGATCTCAT-3' (SEQ ID NO:38). These two oligonucleotide were 
15 annealed and the "fragment 2" obtained. 

X-FLC-I-B was cut with NotI and a 26 kb fragment was separated 
with agarose gel and purified "fragment 3". 

A 9 kb fragment was also prepared by cleavage with Xbal of X-FLC-I- 
B "fragment 4". 

20 These "fragments 1-4" (26 kb left arm, the junction linker, stuffer- 

plasmid, 9 kb right arm) were ligated in the volume of 5 \d. The ligation 
solution was packaged and amplified obtaining the vector X-FLC-II-C. 
These steps were carried out according to standard procedures (Sambrook et 
al., 1989). 

25 From the vector X-FLC-II-C after in vitro excision with Cre- 

recombinase (see later), the plasmid pFLC-II-c (the plasmid of Fig.2j 
comprising the stuffer I of Fig.lc) was obtained. 
"FbcflTn plft fi : Preparation of ?i-FLC-III-F 
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A X-FLC-III-F vector can be prepared as described at the end of 
Example 4, however, other methods of preparation are also possible. One 
alternative way of preparation of XFLC-III-F, which will be described in the 
present example is represented in Fig.7. 
5 To obtain lambda arms and stuffer II (5.5 kb), the cohesive termini of 

10 \ig of X,-FLC-I-B were annealed by incubating for 2 h at 42°C in 180 jutl 10 
mM Tris -CI (pH 7.5)/10mM MgCl 2 . We then added 20 lOx ligation 
buffer and 400 U T4 DNAligase (New England Biolabs) and incubated the 
mixture for 5 h at room temperature. The ligase was inactivated by 

10 incubating for 15 min at 65°C. The concatemerized A.-FLC-I-B was digested 
with 30 units of Xba I (NEB) in 1 x manufactures recommendation buffer. 
The tube was incubated for 2 h at 37°C. 

After restriction, A,-FLC-I-B/XbaI DNA was purified by proteinase K 
(Qiagen) treatment in the presence of 0.1% SDS and 20 mM EDTA, extracted 

15 with 1:1 phenol/chloroform and chloroform, and precipitated with ethanol 
(Sambrook et al., 1989). To avoid problems during resuspension, the DNA 
concentration did not exceed 20 |Lig/mL. 

After careful resuspension for at least 30 min, the digested DNA was 
separated in a 0.6% low-melting point agarose gel (Seaplaque®, FMC) for 1.5 

20 h at 8 V/cm. The portion of the gel containing the 29 kb A, DNA (ligation 
product between L-arm and R-arm) and 5.5 kb stuffer II were cut out and 
equilibrated with TE buffer (Sambrook et al., 1989). The DNAs were 
purified and checked as described (Carninci and Hayashizaki, 1999, Methods 
Enzymology, 303:19-44) by using j3 -agarase (New England Biolabs). 

25 3 jxg of pBS II SK+ (Stratagene) was digested with 9 unit of Bss HII 

(NEB) at 37°C for 2 h and dephosphorylated by CIP (Takara, Japan) 
(Sambrook et al., 1989, standard technique). 

To introduce homing nuclease sites (I-Ceul and Pl-Scel) into pBS II 
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SK+, double strand, an I-CeuI/PI-Scel adaptor oligonucleotide comprising an 
oligonucleotide up adaptor strand: 

o'-pCGCGCTAACTATAACGGTCCTAAGGTAGCGAGTCGACGAGAGAGAG 
AGGATCCATCTATGTCGGGTGCGGAGAAAGAGGTAATGAAATGGCAG-3' 

5 (SEQ ID NO:39) and an oligonucleotide down adaptor strand: 5'- 

pCGCGCTGCCATTTCATTACCTCTTTCTCCGCACCCGACATAGATGGATC 
CGAGAGAGAGAGTCGACTCGCTACCTTAGGACCGTTATAGTTAG-3') 
(SEQ ID NO:40) was prepared (according to standard technique), and ligated 
with pBS II SK+/BssHII (NEB) /CIP (Takara, Japan). 
10 pBS II SK+/BssHII/CIP and I-CeuI/PI-Scel adaptor were hgated, by 

mixing 100 ng of pBS II SK+/BssHII/CIP, 2 ng of I-CeuI/PI-Scel adaptor, 400 
unit T4 DNAligase, lx ligation buffer in a total volume of 5 jjl. The tube was 
incubated overnight at 16°C 

The ligation products were introduced into DH10B and cultured. The 
15 clones containing the proper plasmid were selected by preparing plasmid 
and restriction using I-Ceul (Sambrook et al., 1989, standard technique). 
Then the I-CeuI/PI-Scel adaptor was substituted with Stuffer If (the stuffer I 
of Fig. If) described as following. 

3 jxg of plasmids comprising I-CeuI/PI-Scel adaptor were digested 
20 with 9 units of Sal I and 9 units of Bam HI in 30 |iL To remove the Sall- 
BamHI short fragment, the plasmid/Sall and BamHI were separated in a 
0.6% low-melting point agarose gel (Seaplaque®, FMC) for 1.5 h at 8 V/cm. 
The 3 kb DNA was cut out and equilibrated with TE buffer (Sambrook et al., 
1989). The 3 kb DNA were purified and checked as described (Carninci and 
25 Hayashizaki, 1999, Methods Enzymology, 303:19-44) by using /3 -agarase 
(New England Biolabs). We typically recovered 30% to 50% of the starting 
DNA. 

100 ng of the plasmid DNA and 140 ng of stuffer If were ligated with 
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400 unit T4 DNA ligase, 0.5 yl of 10 x ligation buffer in a total volume of 5 \d. 
The tube was incubated overnight at 16°C. 

The ligation products were introduced into DH10B and cultured. The 
clones containing the proper plasmid were selected by preparing plasmid 
5 and restriction using BamHI and Sail (Sambrook et al., 1989, standard 
technique). 

In the next step loxP sites were introduced into the vector between 
amp r gene and ori. LoxP was introduced by PCR using Xbal - LoxP Tag 
primer 3F (69 mer) having the sequence: 
10 5' -GAG-AGT-CTA-GAT-AAC-TTC-GTA-TAG-CAT-ACA-TTA-TAC-GAA-GTT- 
ATA- AAT-CAA-TCT-AAA-GTA-TAT-ATG-AGT-3' (SEQID NO:41) and Xbal 
- LoxP Tag primer 3R (69 mer) having the sequence: 

5'-GAG-AGT-CTA-GAT-AAC-TTC-GTA-TAA-TGT-ATG-CTA-TAC-GAA-GTT- 
ATA-AAA-CTT-CAT-TTT-TAA-TTT-AAA-AGG -3' (SEQ ID NO:42) 

15 (according to standard technique). 

Using 3 ^ig of the resulting PCR product (7.2 kb), the PCR product 
was digested with 9 units of Xbal at 37°C for 1 h (Sambrook et al.,). To 
remove short DNA fragment resulting from PCR product/Xbal, the digested 
product was separated in a 0.6% low-melting point agarose gel (Seaplaque®, 

20 FMC) for 1.5 h at 8 V/cm. The 7.2 kb DNA was cut out and equilibrated with 
TE buffer (Sambrook et al., 1989). The 7.2 kb DNA were purified and 
checked as described (Carninci and Hayashizaki, 1999, Methods Enzymology, 
303:19-44) by using # -agarase (New England Biolabs). 

The 7.2 kb PCR product, the purified arms and stuffer II (5.5 k) were 

25 ligated in the ratio of 25 ng: 100 ng: 19 ng with 400 units of T4 DNA ligase 
(Sambrook et al., 1989). 

The ligation solution was packaged and amplified obtaining the 
vector X-FLC-III-F. These steps were carried out according to standard 
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procedures (Sambrook et al., 1989). 
Example 7 : Preparation of X-FLC-III-E 

The X-FLC-III-E vector can be prepared by substituting the stuffer I 
of other FLC-III vectors with the stuffer Ie. 
5 In the present example, ^-FLC-III-E was obtained by substituting 

the stuffer If of the k-FLC-III-F vector prepared in Example 6 with the 
stuffer Ie (i.e. the stuffer I of Fig.le) according to the following steps. 

The cohesive termini of 10 jug of ^-FLC-III-F were annealed by 
incubating for 2 h at 42°C in 180jd 10 ml Tris -CI (pH 7.5)/10mM MgCl 2 . 
10 We then added 20 ^iL lOx ligation buffer and 400 U T4 DNA ligase (New 
England Biolabs) and incubated the mixture for 5 h at room temperature. 
The ligase was inactivated by incubating for 15 min at 65°C. 

At this point, the concatemerized X-FLC-III-F was digested with the 
required restriction enzymes, by adding 30 units of BamHI, 30 units of Sail 
15 and 40 jllI lOx BamHI buffer (all purchased from New England Biolabs) in a 
total volume of 400 [xl. The tube was incubated for 2 h at 37°C. 

After restriction, the DNA was purified by proteinase K (Qiagen) 
treatment in the presence of 0.1% SDS and 20 mM EDTA, extracted with 1:1 
phenol/chloroform and chloroform, and precipitated with ethanol (Sambrook 
20 et al., 1989). To avoid problems during resuspension, the DNA concentration 
did not exceed 20 ^ig/mL. 

After careful resuspension for at least 30 min, the digested DNA was 
separated in a 0.6% low-melting point agarose gel (Seaplaque®, FMC) for 1.5 
h at 8 V/cm. The portion of the gel containing the X DNA was cut out and 
25 equilibrated with TE buffer (Sambrook et al., 1989). The JJ3NA were purified 
and checked as described (Carninci and Hayashizaki, 1999, Methods 
Enzymology, 303:19-44) by using j3 -agarase (New England Biolabs). We 
typically recovered 30% to 50% of the starting A,-DNA. 
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To obtain stuffer Ie (figle), 10 fig of X-FLC-I-E were digested with 30 
units of BamHI, 30 units of Sail in 200 \d lxBamHI buffer. The tube was 
incubated for 2 h at 37°C. 

After restriction, the 5 kb DNA fragment was separated in a 0.6% 
low-melting point agarose gel (Seaplaque®, FMC) for 1.5 h at 8 V/cm. The 5 
kb DNA (stuffer Ie) was cut out and equilibrated with TE buffer (Sambrook 
et al., 1989). The 5 kb DNA were purified and checked as described (Carninci 
and Hayashizaki, 1999, Methods Enzymology, 303:19-44) by using (3 - 
agarase (New England Biolabs). We typically recovered 30% to 50% of the 
starting DNA. 

The X-FLC-III-F having the stuffer If removed, and stuffer Ie 
(prepared as above) were ligated (the ratio was 210 ng to 30 ng) by mixing 
with 400 units T4 DNAligase in 10 ul of lx ligation buffer (NEB). The tube 
was incubated overnight at 16°C. 

The ligation solution was packaged and amplified obtaining the 
vector X-FLC-III-E. These steps were carried out according to standard 
procedures (Sambrook et al., 1989). 
Exam ple 8 ; Preparation of pPEST-C 

pBluescript II SK+ (purchased from Stratagene) was cleaved with 
Sad and Kpnl restriction enzymes followed by blunting with T4 DNA 
polymerase (Sambrook et al., 1989) and two fragments were obtained. The 
short fragment was removed by agarose gel electrophoresis and the long 
fragment purified and recovered. The purified long fragment was ligated 
with RfB cassette overnight at 16°C according to standard methodology 
(Sambrook et al. 1989) and introduced into DH10B cells by electroporation 
(Sambrook et al. 1989). Recombinant clone was amplified and plasmid 
extracted (pDEST-A) In order to invert the BssBII fragment in pDEST-A, 
pDEST-A was cut with BssBII restriction enzyme and then extracted by 
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using phenol/chloroform and precipitated by ethanol (Sambrook et aL, 1989) 
and two fragments were obtained. These two fragments, digestion products 0 
of pDEST-A, were ligated overnight at 16°C by inverting the RfB cassette of 
180 degrees (Sambrook et aL, 1989) and the obtained plasmid introduced 
into DH10B cells by electroporation. The clone having the fragment inverted 
was selected (pDEST-C) by restriction mapping (Sambrook et al. 1989). 
Example 9 : Preparation of pFLC-PEST 

X-FLC-II-C and pDONR201 (Life Technologies) were recombined by 
BP clonase (Life Technologies). Then the recombination vector was mixed 
with pDEST-C and recombined by LR clonase. The reaction solution was 
introduced into DH10B cells by electroporation and the recombinant clone 
selected on LB plate containing ampicillin. Recombinant cells were amplified 
and the plasmid (pFLC-DEST) was prepared. 
Example 10 : Preparation of purified p FLO III -f 

100 ng of X-FLC-IE-F were treated with 1U Cre-recombinase (in vitro 
cre-lox mediated recombinase) at 37°C for 1 hour in 300 \xl, and the FLC-III-f 
plasmid was excised. The plasmid was then extracted with 
phenol/chloroform, and chloroform, and precipitated with ethanol (according 
to Sambrook et aL, 1989). The recovered plasmids were electrop orated into 
DH10B (Life Technologies) at 2.5 kb/cm. The cells were spread on LB agar 
containing ampicillin, X-gal (Sambrook et aL, 1989) and cultured overnight 
at 37°C. Blue colony from LB plate containing ampicillin were picked up and 
plasmids prepared using QIAGEN kit. 

The plasmids were digested with restriction enzymes (I-Ceul, Pl-Sce 
I ) according to the following steps. 

First restriction step: a solution of 20 \il of lOXI-Ceu I buffer, 20 jxl of 
10 X BSA and 3U of I-Ceu I (total volume 200 \xL) was prepared in a tube and 
incubated for 5 hour at 37°C. 
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Second step of restriction: 22.5 [xl of 10XPI-Sce I buffer and 3U PI- 
Sce I were added and the obtained solution incubated for 5 hour at 37°C. 
After this step, the tube was heated for 15 min at 65°C. Then, the digested 
DNA was purified by proteinase K treatment (Sambrook et aL, 1989), 
5 extracted with phenol/chrolofolm, chroloform,and prepicipated with ethanol 
(as described in Sambrook et al., 1989). After careful resuspension, the 
digested DNA was separated in 0.8% low melting agarose gel as follows. 
After electrophoresis for 1.5 hours at 50V, the DNA fragments (2.9 kb) were 
cut off from gel and recovered. They were purified with QIAGEN QIAquick 

10 Gel Extraction kit and then used for the ligation. 
Example 11 : Preparation of cDNA and cloning 

Full-length cDNAs were prepared as described (Carninci and 
Hayashizaki, 1999, as above; Carninci et al., 1997, DNA Res., 4:61-66) and 
normalized and/or subtracted (Carninci et al., 2000, Genome Res., 10:1617- 

15 1630) before cloning. After digestion with 25 U BamHI (New England 

Biolabs)/ jj,g cDNA (to cleave the 3' end) and 25 U Xhol (Fermentas Vilnius, 
Lithuania)/ \ig cDNA (to cleave the 5 f end), the cDNA was treated with 1.3 U 
thermosensitive shrimp alkaline phosphatase (SAP; Amersham Pharmacia 
Biotech)/ jug cDNA to avoid concatenation and chimerism of cDNAs, which 

20 are concerns when working with large-capacity cloning vectors. Then the 

cDNA was treated with proteinase K, extracted with phenol/chloroform, and 
applied to a CL-4B spin column (Amersham Pharmacia Biotech). The 
purified cDNA was ethanol-precipitated (Carninci and Hayashizaki, 1999, as 
above) or size-fractionated. Normalization/subtraction was not used for 

25 cDNA that was size -fractionated by using an agarose gel. This process was 

similar to that used in the isolation of the X arms of the vectors: the direction 
of electrophoresis was inverted after short fragments were run out of the gel 
(we changed the buffer before resuming the electrophoresis). cDNA was 
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isolated from the gel either by using /3 -agarase (New England Biolabs) as 
described or by binding in the presence of 7 M guanidine-Cl to double-acid- 
washed and size -fractionated diatomaceous earth (Sigma) essentially as 
described (Boom et aL, 1990, J.Clin.MicrobioL, 28:495-503). 
5 cDNA and vectors were always ligated(according to Carninci and 

Hayashizaki, 1999, Methods Enzymology, 303:19-44) at an equimolar ratio 
in a 5-^L reaction containing T4 DNAligase (New England Biolabs). The 
quantity of cDNAwas estimated by the radioactivity incorporated during 
synthesis of the first and second strands (Carninci and Hayashizaki, 1999, 

10 as above). The cloning sites on the vectors were the SaR (cohesive ends with 
Xhol) and BaniHI sites, except that Xhol and BamHI sites were used for the 
X-FLC-II-C vector. 

cDNA sequencing was performed as described (Shibata K., et aL, 
2000, Genome Res., 10:1757-1771), and sequence analysis and clustering 

15 were performed as described (Konno et aL, 2001, Genome Res., 11:281-289). 
Example 12 : Bulk excision of cDNA libraries 

I) In vivo, solid-phase excision (state of the art) 

cDNA libraries were amplified in K coli C600 cells. Approximately 1- 
5 x 10 4 pfu were plated on 150-mm dishes of LB-agar, topped with LB-agar 

20 containing 10 mM MgS0 4 , and grown overnight to confluence (Sambrook et 
aL, 1989, as above). Subsequently, phage particles were eluted with SM- 
buffer and titered. Then, BNN132 cells were grown overnight in LB-broth 
plus 10 mM MgS0 4 . Cells were pelleted, resuspended in 10 mM MgS0 4 , and 
immediately infected with the phage library, which was converted in vivo to 

25 a plasmid DNA library and plated on LB-ampicillin plates. 

II) In vivo, liquid-phase excision 

Up to 5 x 10 10 phage particles prepared as above were used to infect 
10 mL of overnight-grown BNN132 cells (OD 600 = -0.5) after pelleting and 
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resuspending in 10 mM MgS0 4> which were then cultured in 90 LB medium 
supplemented with 100 |u,g/ml of ampicillin. After 1, 2 or 3 h at either 30°C or 
37°C, the cultures were stopped, and we extracted the plasmid by using the 
Wizard Plus Midiprep DNA Purification System (Promega). The plasmid 
5 library was electroporated into DH10B cells (Life Technologies) at 2.0 Kv/cm, 
which are suitable for sequencing operations as described (Shibata K., et aL, 
2000, as above). 

III) In vitro Cre-lox-media ted excision 

Phage cDNA libraries were amplified in C600 cells as described. We 
10 isolated the library phage DNA from the amplified phage solution by using 
the Wizard Lambda Preps DNA Purification System (Promega). We 
converted one fourth of the obtained phage DNA to plasmid by treating with 
1 U Cre-recombinase at 37°C for 1 h in 300 jliL as recommended (Novagen), 
and then purified (proteinase K treatment, phenol/chloroform extraction and 
15 ethanol precipitation, according to Sambrook et aL, 1989). The bulk-excised 
plasmid libraries were electroporated into DH10B cells (Life Technologies) at 
2.0 kV/cm. 

IV) Gateway-mediated bulk-excision ('indirect?') protocol 

We mixed 16 ng library phage DNA, 300 ng pDONR201(Instruction 
20 Manual, Gateway Cloning Technology, GibcoBRL, Life Technologies), 4 \iL 
BP buffer, and BP Clonase enzyme mix (Life Technologies) in 20 jxL. 
Overnight incubation at 25°C was followed by proteinase K treatment in the 
presence of 0.2% SDS and 10 mM EDTAat 45°C for 15 min. We added 1 jig 
glycogen and extracted the reaction by using phenol/chloroform and 
25 chloroform; the sample was precipitated by using isopropanoL The 

precipitate was mixed with 300 ng pDEST12.2 (Life Technologies), 4 |LiL LR 
buffer, and 4jliL LR Clonase enzyme mix in a volume of 20 juL. The sample 
was further purified with proteinase K/phenol chloroform extraction followed 
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by ethanol precipitation. 

V) "Amplified indirect" protocol 

The sample was treated as in the previous protocol (Gateway- 
mediated bulk excision-"indireet") until the BP Clonase reaction. We 
5 electroporated 1 |uL of the 20-jaL reaction into DH10B cells. The cells were 
spread on LB containing kanamycin, and the resulting colonies underwent 
plasmid extraction (Sambrook et a!, 1989). The prepared plasmids were 
each reacted with LR Clonase and purified and then electroporated as before. 

VI) "One-tube" ('direct?) protocol 

10 The procedure was the same as that for the indirect protocol until 

the BP Clonase reaction (Life Technologies). Then, we added 450 ng 
pDEST12.2, 6 |iiL LR Clonase enzyme mix, and 1 pL 0.75 M NaCl to the tube 
(total volume, 30 M-L). The sample was treated with LR Clonase and purified 
as described. The BP/LR-reacted samples were dissolved in sterile water and 

15 electroporated into DH10B cells. The transformed cells were spread on LB 
plates containing either ampicillin or kanamycin and cultured overnight at 
37°C. 

To assess the conversion frequency of each excision method, we 
prepared the plasmids from 60 random colonies from LB plates. The 
20 plasmids were cut with Pvull, and the sizes of the inserts were analyzed by 
using 0.8% agarose gels. We also could assess the conversion efficiency by 
counting the colonies that grew on ampicillin- or kanamycin-containing 
plates. 

Exam ple 13 : Homing endonuclease system : a vector for ligation-mediated 
25 transfer of inserts: k-FLC-IH-F 
1) Insert cDNA preparation 

cDNA libraries were prepared by cloning the cDNA (prepared as in 
Carninci et al., 2000, Genome Research, 10:1617-1630) into the >.-FLC-III-F 
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vector (Example 6), which carries the homing endonucleases I-Ceul and PI- 
Scel (New England Biolabs) at either side of the cloning sites (SaE and 
BamHI). These homing endonucleases, which recognize and cleave 
sequences of 26 and 39 bp respectively, do not cleave mouse genome (in fact, 
5 these homing endonucleases statistically cut once every 1.8 x 10 18 base pairs 
and once every 1.2 x 10 24 , respectively and therefore are very unlikely to cut 
even once high complex genomes such as Human and Mouse, whose total 
size is about 3 x 10 9 base pairs). Therefore, they are optimal for subcloning 
cDNAs without internal cleavage of any of the tens of thousand clones in a 
10 library. 

A phage cDNA library was prepared according to one variant of the 
cap-trapper technology (Carninci et al., 2000, Genome Research, 10:1617- 
1630) and cloned into X FLC-III-F and amplified in C600 cells (Sambrook et 
al., 1989). We isolated the library phage DNA from 1 ml of the amplified 

15 phage solution by using the Wizard Lambda Preps DNA Purification System 
(Promega). Purified library phage DNA was digested with restriction 
enzymes (I-Ceul, Pl-Sce I ). First restriction step: a solution of 5[xl of 10 XI- 
Ceu I buffer, 5^1 of 10 X BSA and 2.5U of I- Ceu I (total volume 50^1) was 
prepared in a tube and incubated for 4 hour at 37°C. 

20 After this step, the restriction tube was heated for 15min at 65°C. 

The digested DNA was purified by proteinase K treatment (Sambrook et al., 
1989), extracted with phenol/chloroform, and chloroform, and precipitated 
with isopropanol, and very carefully resuspended. The second step 
restriction was carried out as follows: redissolve the DNA in 40 yl of water, 

25 add 5 \il of lOXPI-Sce I buffer and, 4U Pl-Sce I (New England Biolabs, total 
volume 50 pil),and incubate for 4 h at 37°C. After this step, the restriction 
tube was heated for 15min at 65°C. The digested DNA was purified by 
proteinase K treatment , extracted with phenol/chloroform, and chloroform, 
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and precipitated with isopropanol, and very careful resuspension. (as in 
Sambrook et aL, 1989). 

2) pFLC IH -f preparation 

A.-FLCIE-F vector (Example 6) was excised with in vitro cre-lox 
5 mediated recombinase. At first , lOOng of X-FLCHI-F were treated with 1U 
cre-recombinase at 37°C for 1 hour in 300 [xl final volume. Then, extracted 
with phenol/ chloroform, and chloroform, and precipitated with isopropanol 
(Sambrook et aL, 1989). The plasmids were electroporatetd into E. coli 
DH10B (Life Technologies) at 2.5 kv/cm following the instruction of the 

10 manufacturer. Cells were spread on LB-agar (Sambrook et aL, 1989) 

containing 50 [xg/ml of ampicillin. lb the surface of the agarose in the 9 cm 
petri dish, we added also 40 microliters of 2% X-gal and 7 microliters of 200 
ml IPTG for colorimetric detection of the plasmid carrying the LacZ stuffer 
I to facilitate later identification of the background (for a theoretical 

15 consideration: Sambrook et aL, 1989). The plate was cultured overnight at 
37°C and the day later several dozens colonies appear. We picked one blue 
colony from the above LB, inoculated in 50 ml of LB-broth/50 microgram/ml 
ampicillin and let grow overnight with 300 rpm shaking (Sambrook et aL, 
1989). Next day we prepared plasmid DNAby QIAprep spin mini prep kit 

20 (QIAGEN). 

3) Plasmid vector preparation (removal of the stuffer D (see also Fig.8) 

This step is to prepare a plasmid (in this case pFLC-III-f) devoid of 
the stuffer I (in this case stuffer of Fig. If) to maximize the recombination. 

Three ^g of plasmids cDNA were digested with restriction enzymes 
25 (I-Ceu I , PI- See I ). In the first step restriction was done in total volume 50 
\il in presence of 5 \il of lOXI-Ceu I buffer, (New England Biolabs), 5 \il of 
10 X BSA (bovine serum albumine supplied by New England Biolabs with the 
enzyme) and 4U of I-Ceu I (New England Biolabs, and incubation for 4 hour 
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at 37°C. After this step, the restriction tube was heated for 15min at 65°C. 
Digested DNA was purified by proteinase K treatment, extracted with 
phenol/chloroform, and chloroform, and precipitated with isopropanol, and 
very carefully resuspended (Sambrook et ah, 1989). The second restriction 
5 step was done in a total volume of 50 \il supplemented with. 5 fxl of 10 X PI- 
Sce I buffer (New England Biolabs), 4U Pl-Sce I (New England Biolabs,), 
and incubated for 4 hour at 37°C. After this step, the restriction tube was 
heated for 15min at 65°C. Digested DNA was purified by proteinase K 
treatment, extracted with phenol/chloroform, and chloroform, and 

10 precipitated with isopropanol (Sambrook et al., 1989). After very careful 
resuspension, the digested DNA was separated in 0.8% low melting agarose 
gel (seaplaque agarose FMC) buffered with TAE (TYis-acetate-EDTA; see 
Sambrook et al., 1989). In the following step: after electrophoresis for 1.5h 
at 50V, the DNA fragment corresponding to the empty plasmid vector (2.9kb) 

15 was cut off from gel and purified by QIAGEN QIAquick Gel Extraction kit 
(QIAGEN). 

4) li gation of cleveaged plasmid pFLC-I II-f and cDNA i nsert (see also Fi%.8) 

7.5ng of prepared insert and 100 ng of pFLG H -f plasmid vector, 
prepared in the above step 3), were mixed in a final volume of 100 

20 containing also 10 X T4 DNA ligase buffer (New England Biolabs) and DNA 
200U of T4 ligase (New England Biolabs) and incubated at 16°C overnight. 
Ligated palasmids were electroporated into DH10B at 2.5 Kv(Kilovolt)/cm 
(Invitrogen) following the manufacturer' s instruction. Cell were spread on 
LB containing ampicillin (as above), and cultured overnight at 37°C. We 

25 picked then randomly 12 colonies and prepared plasmids (inoculation in 3 ml 
LB-broth/50 microgram/ml ampicillin and let grow overnight with 300 rpm 
shaking (Sambrook et al., 1989). Plasmid DNA was prepared with a 
Quiagen plasmid DNA extraction kit. 
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The plasmids were cut with PvuE (New England Biolabs) in 
presence of IX PvuJL buffer) and their insert size was analyzed using 0.8% 
TBE agarose gel stained with Ethidiumbromide (Sambrook et al., 1989).. 
5) Result 

5 Titer : pFLCHI-f + insert (cDNA):2.1 X 10 4 pfu/ml 

Insert size check (average size) 
Excision protocol here presented: 3.07kb 

In vitro Cre-lox mediated recombinase (control experiment): 3.1kb. 
The control experiment consisted in the same library excised with the Cre- 
10 lox following protocol as the example 12, (number III, in vitro Cre-lox 
mediated excision). 

It has been known in the art that the use of restriction enzymes give 
high size bias. In fact, usually plasmid libraries prepared by ligation show 
half the size of lambda-excised cDNA libraries (in Table 2 the cerebellum 
15 library is 1.4 Kb in pBluescript while 3.36 Kb with k-FLC-I-B: the size is 
only 41.6%, and therefore not very efficient). 

In the current example, instead, the size with the homing nucleases 
is 3.07 kb versus 3.0 kb, the 99%, which is almost not relevant size bias (a 
1% size bias enters in the statistical variability). In conclusion, we proved 
20 that the excision system using homing endonucleases restriction enzymes is 
an efficient excision system. 

. Example 1 4 : Vectors for size selection and background-reducing system s 
The X-FLC-I-B and other vectors shown in the Figures 1 and 2 has 
25 been used to successfully prepare libraries of full-length mouse cDNA, and 
showed to having a cloning capacity of ~0.2 to 15.4 kb cDNAs. 

When we tried to clone strongly subtracted cap-trapped cDNAs 
(according to the method described in Carninci et aL, 2000, Genome fies., 10: 
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1617-1630), we found that because of the paucity of cDNA (less than 10 ng), 
using X-FLC-I-B led to a certain background. When this background 
exceeded 20% to 30%, it affected the cost-performance of subsequent large- 
scale sequencing operations. To develop a vector associated with less 
5 background, we prepared a new, very effective method to decrease the 
background of A,-phage libraries that are excised into plasmids. We 
substituted the stuffer I in A,-FLC-I-B with that in Figure le to produce the 
k-FLC-I-E. The stuffer of this vector carries 2 copies of the "suicide gene" 
ccdB (Bernard and Couturier, 1992, J. Mol. Biol y 226: 735-745) and a 

10 functional LadL for blue-white selection (Fig. If). Notice that the LadL 

present in the pBluescript-derived fragment is nonfunctional because it is 
disrupted by either stuffer I or the cloned cDNA. Interestingly, K phages 
carrying the ccdB gene can replicate in & coli C600; this suggests that 
during the lytic cycle of the X phage, DNA gyrase, the target of the ccdB gene 

15 product, is dispensable. 

After the excision procedure, we plated the equivalent of up to 300 pg 
of the excised vector (without insert) but did not obtain any colonies. On 
the contrary, in a control experiment, we obtained more than 1175 colonies 
(equivalent to the background) when we plated the equivalent of ~3.5 pg of a 

20 similar construct containing a 3.6-kb insert but without ccdB instead of the 
stuffer. This difference constitutes an impressive background reduction of 
at least 10 5 -fold, similar to that of X-FLC-III-F (described later). 
Example 15 : DNA contamination background 

All of the tested background-reducing stuffers like those in Figures 

25 ld-f yielded undetectable background derived from nonrecombinant vectors 
and therefore can be considered interchangeable. With the vectors X-FLC-I- 
E, X-FLC-III-F, X-FLC-III-D, k-FLC-III-S-F, and >.-FLC-I-L-D, the 
background depend on the environmental DNA contamination. In a test 
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experiment, we did not ligate any cDNAto X-FLC-I-E. Because there was 
no background to reduce at the ^-plating stage, we obtained 8.4 x 10 4 pfu/jxg 
vector, which included the contribution of non recombinant vector, compared 
with typical values of > 10 7 pfu/fxgfor positive controls. We amplified the 
5 background plaques, excised the plasmids, analysed 12 clones, and 

sequenced representative samples showing different electrophoretic patterns. 
The background clones that remained after the selection were derived only 
from the E. coli genome, which was probably a residual from the dead E. coli 
cells during the vector DNA preparation, whereas no vector sequence was 

10 found in any insert. Therefore, if a goal is the complete absence of 

background, all contaminating genomic DNA must be eliminated from the 
XDNA preparations and, perhaps more importantly, cDNAs must have intact 
ends so that they are easily clonable. 
Example 16 : Background-reduction loxP system 

15 The background reduction associated with stuffer I differs from that 

of the stuffer in A.-FLC-I-E, because we independently tested a double 
strategy using a single copy of ccdB and an additional loxP site inserted into 
the stuffer I (Fig. If). During the excision process, the third loxP site 
favours the separation of the origin of replication from bla (the gene for /3 - 

20 lactamase, for conferring resistance to ampicillin), as shown in Figure li. 
To eliminate this problem, we manipulated the order of the plasmid 
sequence and loxV elements in the X- vector so that the loxP on stuffer I was 
between bla and the origin of replication. Neither of the defective excised 
plasmids can replicate or confer antibiotic resistance (Fig. li). 

25 In a preliminary experiment, we constructed a A,-FLC-III-type vector 

that contained as a stuffer only the background-reducing sequence of Figure 
li but without the ccdB gene. We obtained 43 colonies from ~3.5 pg of the 
excised plasmid compared with 771 from ~3.5 pg of a control excised plasmid 
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of the same size that lacks both the loxP background reducing sequence and 
the ccdB gene. Therefore, the lox V background-reducing sequence 
eliminated 94.4% of the background. When ccdB was added to the loxP- 
containing stuffer, the resulting vector did not yield any colonies even when 
5 we electroporated up to 350 pg of excised plasmid, which had a background- 
reducing element like that in Figure If. This result corresponds to a 
background reduction of at least 7.7 x 10 4 -fold, a factor similar to that 
obtained with the background-reducing element of the X-FLC-I-E vector. 
The background-reducing systems of both the >^-FLC-III-F and A,-FLC-I-E 
10 vectors were considered sufficient for our full-length cDNA cloning purpose. 
Exam ple 17 : Bulk excision of cDNA libraries 

Before bulk excision, cDNA libraries are optionally amplified on a 
solid-phase medium according to the standard procedure (Sambrook et 
al.,1989). 

15 This process does not decrease the size of the cDNA library, but 

because of the preferential packaging of long phages, decreases (but does not 
eliminate) the frequency of the phages that carry cDNA inserts of 
approximately <; 0.5 kb. Amplification in C600 cells eliminates 
hemimethylation, which is used to clone the cDNA (Carninci and 

20 Hayashizaki, 1999, as above). Hemimethylated cDNA of a primary cDNA 
library would be cleaved during the in vivo excision in BNN132 (described 
later). 

D Cre-lox-bas ed excision - In vivo solid-phase excision 

The in vivo solid-phase excision process (representing the state of the 
25 art) seems straightforward (Figure 3), simply requiring infection of the 
amplified cDNA library into the BNN132 bacterial strain, which 
constitutively expresses Cre-recombinase (Elledge et al., 1991, Proe. Natl. 
Acad. Sci. USA, 88:1731-5). However, this practice is not recommended, 
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because of plasmid instability (Summers et aL, 1984, as above) and low 
plasmid yield (Palazzolo et aL, 1990, as above). In fact, Gre-recombinase is 
expressed constitutively, causing formation of plasmid dimers and multimers 
and leading to a high proportion of plasmid-free cells (Summers et aL, 1984, 
5 as above), thereby impairing the sequencing efficiency. We confirmed that 
low plasmid yield and plasmid loss after prolonged culture are the rule when 
using BNN132 as a host strain for cDNA libraries. 
II) Cre-lox-based excision - In vivo liquid-phase excision 

The in vivo liquid-phase excision process overcomes this problem of 
plasmid loss and poor yield after prolonged culture: we extracted the excised 
plasmid cDNA library after a brief culture at 30°C or 37°C and electroporate 
into any convenient E. coli strain, such as DH10B. Similar results in terms 
of size of the excised library were obtained after culture/excision for 1, 2, or 3 
h at either 30°C, which is supposed to preserve the size of the library 
unbiased by keeping the plasmid at a low copy number (Lin-Chao et aL, 1992, 
Mol. Microbiol, 6:3385-3393), or 37°C, at which plasmids are expressed at 
increased copy number. The copy number is also inversely proportional to 
the size of the cDNA inserts. When we excised a cDNA library cloned in X,- 
FLC-I-B, the final titer after the excision was 2.4 x 10 8 cfu/[xg after culture 
for 1 h at 30°C, 9.1 x 10 8 cfu/|Ag after 2 h at 30°C, and 1.4 x 10 9 cfu/^ig after 3 
h at 30°C. The titers after growth at 37°C were 1.5 x 10 9 cfu/jxg after 
incubation for 1 h, 9.8 x 10 8 cfu/^g after 2 h, and 2.8 x 10 9 cfu/^ig after 3 h. 
The average insert size was 4.1, 3.9, and 3.3 kb for 1, 2, and 3 h at 30°C, and 
2.9, 3.6, and 3.8 kb for 1, 2, and 3 h at 37°C, respectively. These results 
suggested that there were no noteworthy excision-associated problems 
related to the length of inserts or to the temperature and duration of the 
BNN132 E. coli culture. 

To better quantify the size bias associated with the Cre-lox excision 
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system, we mixed an equal number of non-recombinant A,-FLC-I~B vectors 
carrying the 10-kb stuffer with phages from the amplified cDNA library, 
then infected the cells. The ratio of clones containing the 10-kb insert was 
close 50% at all of the described conditions. This result confirms the 
robustness against size bias of the Cre-lox excision system. Among the 
advantages of this in vivo liquid-phase excision method is the high DNA 
yield, which facilitates downstream operations, such as the production of 
consistent quantities of single-stranded plasmid DNA by using Genell- 
Exdlll, which can be used for further normalization/subtraction of existing 
cDNA libraries (Bonaldo et al., 1996, Genome Res., 6:791-806) while 
avoiding plasmid amplification steps that could decrease the size of the 
amplified library. 

III) Cre - lox-ba sed excisio n ■ In vitr o excision 

Although it does not show size bias, the in vivo liquid-phase excision 
procedure still involves a brief round of library amplification, which might 
cause sequence-specific representational bias. Therefore, we developed the 
in ^dtro excision method, which is based on Cre- mediated recombination. 

This excision system uses purified XX) NA from the amplified cDNA 
library, followed by electroporation. For this application, we tested the 
electroporation conditions described for long BAG inserts (Sheng et al., 1995, 
Nucl. Acids Res., 23:1990-1996). In light of our results from sizing 60 
plasmids after restriction with Pvull , we did not find significant differences 
in the final size of the plasmid cDNA library when we used pulses between 
1.7 and 2.5 kV/cm. We regard the Cre-lox in vitro excision protocol as the 
most suitable of those we tested, because it does not require even a brief 
amplification step of cDNA libraries in BNN132, is robust in terms of size 
bias, and can be used with all of the vectors described here. 

IV) Gateway?™ -system-mediate excision 
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For ^-FLC-II-C, in addition to the Cre-lox excision protocol for 
excising a pFLC-II plasmid (Fig. 2h), we have developed protocols for bulk 
excision which are based on the Gateway system. 

Inserts are at first transferred into an entry vector, the pDONR201 
5 (Life Technologies), followed by transferring to a destination vectors, the 
pDEST12.2 (Life Technologies, structure not shown). 

X.-FLC-II-C vector that we prepared carries the Gateway ati£>\ and 
at£B2 sequences for transferring individual clones (Walhout et al., 2000, as 
above) or bulk libraries into different functional vectors (Fig. 2c) or into 
10 pFLC-DEST (Fig. 2j) for sequencing. 

The three Gateway excision protocols (the "indirect", "amplified 
indirect", and "direct" protocols) are outlined in Figure 3 and described above 
in the experimental part. 

Any of the Gateway -mediated bulk-excision protocols was a valid 
15 alternative to the Cre-loxhvXk excision procedure. In fact, the average size 
of 60 clones from the excised cDNA sublibraries was 2.3 kb for the control 
Cre-lox reaction (in vitro Cre-recombinase protocol), 2.4 kb with the 
"indirect" protocol, 2.5 kb with the "amplified indirect" protocol, and 3.3 kb 
with the "direct" protocol. The average size of this cDNA before excision 
20 was 3.7 Kb. Considering the final size close to the average size of mRNAs 
on gel, we considered the excision systems satisfactory. The Gateway- 
mediated excision system is anyway very attractive when sufficient cDNAis 
available for cloning into X-FLC-II-C, which accommodates the use of the 
Gateway excision protocols. In light of the requirements of our sequencing 
25 operation, we used pFLC-DEST (Fig. 2j) as our destination vector. 

Exam ple 18 : Comparative example between 6.0 kb and 5,5 kb Stuffer I I 
vectors 

1) Vectors construction 
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X-FLC-I with 5.5 Kb stufferll was constructed as described before in 
the examples above. To compare the cloning size, ?v,-FLC-I with 6.0 Kb 
stufferll was constructed. We added a 0.5 Kb fragment in the Hindlll site 
on the 5.5 Kb stufferll. 0.5 Kb fragment was obtained by restriction 
5 digestion with Hindlll of mouse genomic DNA. Mouse genomic DNA was 
digested with Hindlll and 0.5 Kb fragment was separated by gel 
electrophoresis. The fragment was subcloned into the pBluescript + 
(stratagene) and cleaved by Hindlll and inserted into Hindlll site on the 5.5 
Kb stufferll fragment subcloned into the pBluescript. The 6.0 Kb stufferll 

10 was recovered by the restriction digestion of AscI and ligated into K left arm 
and right arm with 10 Kb stufferl and pBluescript. 
2) Preparation of arms for cloning 

X-DNA was prepared by QIAGEN lambda Midi kit (#12543). 

The cohesive termini of 10 jxg of the lambda DNA were annealed by 

15 incubation for 2 hours at 42°C in 180 \xl of 10 mM Tris-Cl pH 7.5, lOmM 

MgCl 2 , and we added 20 \xl of 10 x Ligation buffer and 400 unit of T4 Ligase 
(both of NEB Kit), and incubated for 7 hours at room temperature, followed 
by ligase inactivated for 15 min at 65°C. The above A,-DNA was digested 
with restriction enzymes (all purchased from New England Biolabs, Inc.) in 

20 3 steps by addition of 50 mM, lOOmM and then 150mM NaCl (final 

concentration at each of the three steps). The first step restriction was done 
in 50 mM NaCl by addition of 2 jxl of 5M NaCl, 10 \xl of NEB 2 buffer, 73 [xl of 
H 2 0, 40 units of Xhol, 20 units of Spel and 32 units of Pad for both vectors 
and then the sample was incubation for 2 hours at 37°C. The second step 

25 was done in 100 mM NaCl by addition of 2 jxl of 5M NaCl, 20 yl of lOx NEB 3 
buffer, 180 \xl of H 2 0 and 20 units of Swal and incubation for 2 hours at room 
temperature. After this step the reaction tube was heated for 15 min at 
65°C. Finally, the third step was done in 150mM NaCl by addition of 5 \xl of 
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5M NaCl, 60 units of SaR and 60 units of BamHL, and incubation for 4 hours 
at 37°C. After restriction the DNA was purified by Proteinase K treatment 
in presence of 0.1% SDS and 20 mM EDTA, extracted with 
phenol/chloroform and chloroform, and precipitated with ethanol (Sambrook, 
5 et aL, 1989), DNA concentration should not exceed 20 jig/ml to avoid 

resuspension problems. After very careful resuspension for at least 30 min, 
the digested DNA was separated in 0.7% low-melting agarose gel (Seaplaque, 
FMC) in the followings steps. After electrophoresis for 1.5 hours at 8 V/cm 
the DNA fragments which was shorter than 19 Kb of the Styl-digested X 

10 DNA were cut off from the gel (step 1). Then, the electrophoresis buffer 
(lxTBE) was changed for fresh one and the remained DNA in the gel were 
electrophoresed to the opposite orientation at 8 V/cm for 2.5 hours. At this 
point the shorter DNA than 19 kb were cut off again (step2). The buffer 
was changed again. The remainder of DNA in the gel were electrophoresed 

15 to the same orientation of the step 1 at 8 V/cm for 30 min in order to compact 
the region containing the X arms DNA for shorter reaction volumes. Finally 
the X arms DNA were cut off (step 3), and purified and checked as previously 
described (Carninci and Hayashizaki, 1999, as above) with j3 -agarase 
(NEB) after equilibration of the gel with TE buffer (Sambrook et aL, 1989). 

20 3) Construction of the test insert 
250 bp test insert 

X-DNA was digested with PstI and electrophoresed in the 2 % low 
melting agarose gel. 200-300 bp bands were cut off and purified by 
QIAquick Gel Extraction Kit (Qiagen). 200-300 bp PstI fragments were 

25 subcloned into the pBluescript and digested with BamHI and Sail. 250 bp 
BamHI-Sall fragmet was separated in 2.0 % low-melting agarose gel and cut 
off and purified by Qiagen Kit. 
2kb test insert 
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The plasmid containing 2.0 Kb mouse cDNA was used as PCR 
template. 2 Kb insert was amplified with the IstBS primer and 
2ndXprimer and purified by Proteinase K treatment in presence of 0.1 % 
SDS and 20 mM EDTA, extracted with phenol/chloroform and chloroform 
and precipitated with ethanol (Sanbrook, et al., 1989, as above). PCR 
products were digested with BamHI and Xhol (cohesive ends with Sail) and 
purified as described above. 
6 Kb test insert 

6 Kb test insert was prepared as described above for the previous 

inserts. 

10 Kb test insert 

p-FLC-I with 10 Kb stufferl was digested with BamHI and Sail and 
purified by proteinase K as described above. The 10 Kb BamHI-Sall 
fragment was separated with 0.7 % low-melting agarose gel electrophoresis 
and isolated from gel with p-agarase (NEB) after equilibration of the gel 
with TE buffer (Sambrook et al, 1989) 
4) Insert size check 

4 kinds of test insert was ligated into X-FLC-I with 5.5 Kb stufferll 
and k-FLC-I with 6.0 Kb stufferll. 200 bp, 2 Kb, 6 Kb and 10 Kb test 
inserts were ligated at ratio 1:1:1:1 or 3:1:1:1 to the both vectors, respectively. 

Subsequently, the packaging reaction was performed using MaxPlax 
Lambda Packaging Extract (Epicentre Technologies). The phage solutions 
were amplified in C600 cells. lxlO 4 pfu were plated on 90 mm dishes of LB- 
agar and topped with LB-agar containing 10 mM MgS0 4 and let grow 
overnight to confluence (Sambrook et al., 1989). The phages particles were 
eluted with SM-buffer and titered. The phage DNA was extracted and 
converted to plasmid with 1 U Cre-recombinase at 37°C for 1 hour in 300 uL 
as recommended (Novagen, Madison, WI, USA), and the purified by S400 
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spun column (Pharmacia). The excised plasmids were electroporated into 
DH10B cells (Life Technologies) at 2.5 KV/cm and plated on the LB-agar 
plate containing 100 ug/ml ampicillin. Each 96 colonies were picked up and 
the plasmid preparation was performed by the plasmid extraction automatic 
5 instrument, solutions and protocols obtained by KURABO (however, any 
other method of purification of plasmid, for instance according to Sambrook 
et al.,1989, can be used). The plasmids were digested with PvuII and insert 
size was checked by agarose gel electrophoresis. 
Results are shown in Table 1. 

10 Table 1 







5.5 kb stuffer II 


6.0 kb stuffer II 


10.0 kb 


insert 


5 


3 


6.0 kb 


insert 


43 


27 


2.0 kb 


insert 


42 


50 


0.25 kb 


insert 


3 


2 



Vectors stuffer II of 5.5 kb were able in 43 cases to accept inserts of 6 
kb and in 5 cases inserts of 10 kb. The inserts of 6 and 10 kb corresponding 
to long and full-length cDNAs. 

15 The result demonstrated that vectors comprising a stuffer II of 5.5 kb, 

allowed the insertion of cDNA inserts of long sizes (6.0 and 10.0 kb) more 
efficiently than vectors comprising a stuffer II of 6.0 kb. A vector having CS 
of 37.5 kb (that is stuffer II of 5.5 kb) is advantageous for preparing full- 
length cDNAs libraries than a vector having the CS size of 30 kb (that is 

20 stuffer II of 6 kb). 

Example 19 : The gene discovery is correlated with the a verage insert size of 
the cDNA library 
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I) A vector for cloning size-selected cDNA with liga tion -m edia ted clone 
transfer: X-FLC-III-L-D (Fig. 2e) 

Similar to A.-FLC-I-L-B and X-FLC-I-L-D, X-FLC-III-L-D lacks stuffer 
II and therefore is used for cDNA libraries with large inserts. This vector 
carries the same background-reducing element as X-FLC-I-L-D, but X-FLC- 
III-L-D differs from X-FLC-I-L-D in that excision of X-FLC-III-L-D yields a 
pFLCIII-d plasmid (the plasmid of Fig. 2i comprising the stuffer I of Fig. Id), 
which is suitable for subcloning without internal cleavage of cDNAs. 

II) A vector for short cDNAs and ligation-mediated transfer of inserts: X- 
FLC-III-S-F (Fig. 2f) 

The mRNA of many organisms that are evolutionarily far from 
vertebrates, such as Arabidopsis thaliana and Oryza sativa (rice), is shorter 
(typically 1 to 1.5 kb on an agarose gel) than that of vertebrates. When 
working with invertebrates, size selection like that used in all of the 
previously described examples may bias for long inserts, which may not be 
representative of the starting mRNA. Even though gene discovery from 3 
rice libraries has been excellent even when we use A,-FLC-I-B, we prepared 
A,-FLC-III-S-F to address this concern. X-FLC-III-S-F is the same as the 
previously described X-FLC-III-F but has a longer stuffer II (6.3 kb). With 
the 6.3-kb stuffer II, the nominal cloning size is 0 to 14.9 kb, which 
facilitates cloning relatively short cDNAs. The background-reducing 
element of A.-FLC-III-S~F is that in Figure If, and this vector produces, after 
excision, a pFLCIII-f plasmid (the plasmid of Fig. 2i comprising the stuffer I 
of Fig. If). 

III) Full-length cDNAs 

The full-length cDNA we used was prepared as described (Carninci 
and Hayashizaki, 1999, as above) and was normalized/subtracted (Carninci 
et al., 2000, Genome Res., 10:1617-1630). cDNA prepared with any other 
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technique can be directionally cloned into the A.-FLC vectors, provided that 
the restriction sites are compatible or that the vector is properly modified. 

The average insert size of cDNA cloned into A,-FLC-I-B was always 
longer than that for the same cDNA cloned into other vectors (Table 2; 
5 average size of cDNA libraries using various vectors). 



T able 2 



Tissue 


Vector 


titer 


size (Kbp) 


Placenta 


k-ZAP II 


4.6xlO B 


1.3 


Placenta 


X-FLC-I-B 


1.8xl0 5 


2.34 


Cerebellum 


pBluescript 


8.6xl0 4 


1.4 


Cerebellum 


X-FLC-I-B 


3.7xl0 5 


3.36 



The average insert size of the X-FLC-I-B library was 1.8 times larger 
10 than that of the X,-ZapII library and 2.4 times larger than that of the plasmid 
cDNA library. 

We correlated the average insert size of each cDNA library in Table 3 
and Figure 4 with the complexity of the library. In fact, these libraries were 
sequenced for the gene discovery program during the construction of the full- 

15 length cDNA encyclopedia (RIKEN mouse cDNA encyclopedia, RIKEN and 
Fantom Consortium, Nature, Vol. 409: 685-690. The redundancy obtained 
by sequencing randomly picked clones and clustering clones with the same 
ends (Konno et al., 2001, as above) was compared by using 7 cDNA libraries 
cloned in A,-Zap II (conventional vector) and 9 cDNA libraries cloned in X,- 

20 FLC-I-B (Table 3). To facilitate comparing differences in the complexity of 
these libraries, we show not only the clustering data after completion of 
sequencing of a given library but also the number of clusters after the 
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available number of runs closest to 5000 sequencing passes. The 
conventional vector did not accommodate the preparation of complex, low- 
redundancy cDNA libraries from any tissue. In contrast, all of the 
normalized/subtracted cDNA libraries cloned into k-FCL-I-B showed higher 
5 complexity (average, 3392 clusters / 4826 reactions; redundancy, 1.42) than 
did normalized/subtracted libraries with the conventional vector (average, 
2089 clusters / 4773 reactions; redundancy, 2.28). Even if we cannot expect 
to know a priori the variety (or complexity) of gene expression in a given 
organ, the complexity was supposed to be very high for the pooled total 

10 "embryo 10+11" library (Table 3). However, the "embryo 13 forelimb" 

library, which is cloned in X-FCL-I-B and which covers a relatively restricted 
biological phenomenon, showed higher complexity than did the "embryo 
10+11" library, which surely contains an increased variety of genes because 
it includes many developing organs and neuronal tissues. 

15 A more direct comparison comes from the libraries made from 

embryonic stem cells (ES cells); these libraries were all prepared from the 
same starting RNA. The number of clusters after 5104 sequencing 
reactions (total number of sequenced samples) is 3068 for the X-FCL-I-B- 
cloned cDNA but just 2362 after 5160 sequencing reactions for the library in 

20 the conventional vector. That is, 31% more clusters were discovered by 
using X-FCL-I-B. The difference is even more striking after additional 
sequencing reactions : 4971 clusters were categorized after 10514 sequencing 
reactions for the k-FCL -I-B-based library and only 3795 clusters after 10492 
sequencing reactions of the conventional ZAP vector library (see Figure 14); 

25 then, 15 520 sequencing passes of the conventional ZAP vector library (48% 
more) led to only 4566 clusters (9% fewer)) Fig. 14). Notice also that 
although both the ES cell libraries were normalized and mildly subtracted 
with the same drivers, the C3 library (which was in X.-FCL-I-B) was also 
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subtracted with genes that were already categorized. Although we expected 
that a strongly subtracted library would contain a lower variety of genes, 
this was not the case. 

These data support the notion that the capacity to clone long cDNAs 
accelerates new gene discovery when full-length approaches are used. In 
addition, the introduction of the X-FGL vectors during the course of the 
preparation of the mouse cDNA encyclopedia restored a high rate of gene 
discovery (Table 3). 

Noteworthy also is the increased rate of new genes identified by 
using 5 f -end readings of X-FLC-based libraries, which suggested that 
previously available cloning protocols and vectors have biased the gene 
discovery for short cDNAs. 

The 7^-FLC vector family according to the invention demonstrated to 
be a powerful tool for high-efficiency cloning of full-length cDNA, gene 
discovery, and bulk transfer of selected cDNA clones into vectors for 
functional analysis, such as expression vectors. 
Example 20 : A,-BAC vector construction 
1) Preparation of "component 1" (Fig.9^ 

10 [xg of plasmid named pFLC-III-e were digested with 10 units of 
restriction enzyme BssHII (New England Biolabs also indicated as NEB) in 
20 id of lx supplied buffer (NEB) at 37°C for 1 hour. The pFLC-III-e/ 
BssHII was separated with TAE (Tris-acetate-EDTA buffer, Sambrook et al., 
1989) 0.8% low-melting agarose gel (SeaPlaque, FMC) at 50 V for 1 hour (see 
Sambrook et al, 1989). The plasmid band was cut out from the gel and 
digested with (3-agarase (New England Biolabs) as suggested by the 
manufacturer (alternatively, also the standard technique described in 
Sambrook et al., 1989 can be used). 

The 5 kb of stuffer I was cut out from the gel and sliced. The gel 
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was mixed with 1 ml of lx p-agarase buffer (NEB). The tube containing the 
gel was put on ice for 30 min to equilibrate with lx (3-agarase buffer. The 
buffer was removed from the tube by pipetting and put a new lx (3-agarase 
buffer. The tube was put on ice for 30 min. This buffer exchange cycle was 
5 repeated once more. The buffer was removed and the tube was incubated at 
65°C for 5 min to melt the gel. 10 unit of (3-agarase (NEB) were added to 
the tube and incubated for 5 hours. Phenol/chloroform extraction was done 
and precipitated with ethanol according to standard techniques (Sambrook 
et al., 1989). The precipitated 5 kb fragment was dissolved with 5 [xl of TE 

10 (10 mM Tris-HCl, 1 mM EDTA, pH 7.5) and indicated as "component 1" . 
2^ Preparation of "component 2" (Tig. 9) 

ApBeloBACll derivative prepared according to Fig.l of US 
5,874,259 (herein incorporated by reference) was used in the following 
"preparation of component 2" experiment. According to the description of 

15 US 5,874,259, the basic pBeloBACll (Kim et al., 1996, Genomics, 34:213- 
218) was modified by as following: ligating together the oriV element (SEQ 
ID NO:43) and the FRT element (SEQ ID NO:44) and the resulting fragment 
was made blunt and ended and then ligated into the Xhol site which had 
been made blunt end. The orientation of the two joined fragments is such 

20 that when the fragment is cloned into the Xhol site, the ori is physically 
located between the nearby FRT site and the insert cloning site. 

3 \ig of this pBeloBACll derivative (Fig.9) was cleaved with 10 U of 
the restriction enzyme Sail (NEB) in 30 \il as recommend by the 
manufacturer (37°C in the supplied buffer) and then dephosphorylated by 

25 adding 1 unit of CIP (Calf Intestinal Phosphatase) (Takara, Japan) at 37°C 
for 30 min (a general use of dephosphorylation to reduce the cloning 
background is disclosed in Sambrook et al., 1989) followed by separation 
using TAE 0.8% low-melting point agarose (SeaPlaque, FMC) at 50 V for 1 
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hour (standard technique, Sambrook et aL, 1989). 

The agarose gel region containing the plasmid fragment of 6.7 kb 
indicated in Fig.9 as "component 2" was cut out of the gel (approximately 200 
microliters) and digested with 10 units of |3-agarase (NEB) for 5 hours, 
5 extract with phoenol/chloroform and then followed by ethanol precipitation 
same as shown in component 1. 
3) Pre paration of "c o mponent 3" (Fig T 9) 

A double strand oligonucleotide "adaptor" (Fig.9) comprising the 
upper strand: 5 ? -pTCGAAGCTTCCG-3 ? (SEQ ID NO:45) phosphorylated at 
10 the 5' end and the lower strand: 5'-CGCGCGGAAGCT-3' (SEQ ID NO:46) 
was prepared using oligosynthesized using an automated synthesizer 
(EXPEDITE 8909 using the standard protocol and reagents). 
4^ Ligation of "components 1. 2 and 3" (Fig.9^ 

"Component 1" (pFLC-IH-e/fesHII fragment), "component 2" and 
15 "component 3" were mixed together in the ratio of 50 ng: 37 ng: 0.1 ng in the 
presence of lx buffer (prepared by dilution to 1/10 from a stock of lOx 
supplied by the manufacturer NEB), 400 units of T4 DNAligase (NEB) in 
final 5 jlxI of final volume reaction (buffer lx dilution, DNA, adaptor, DNA 
ligase). 

20 The mixture was incubated at 16°C overnight to complete the 

ligation reaction. 

After the addition of NaCl at 0.2 M final concentration into the 
ligation reaction, the ligation products were precipitated with 2 volumes of 
96% ethanol and 1 \xg of Glycogen (Roche) -according to the standard 
25 techniques (Sambrook et aL, 1989) and the ligated products were recovered 
by ethanol precipitation according to standard protocol (Sambrook et aL, 
1989). The ligation products were dissolved in 10 \il of H 2 0. 

ljxl of the recovered ligation products were electropotrated into 20 yl 
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of DH10B electrocomponent cells (Invitrogen) at 2.5 KV.cm (according to 
Invitrogen) instructions followed by plating the elctroporeted plasmid cells 
on LB-agar-supplemented with ampicillin at 50 \xglml. To select positive 
clone which has modified pBAC, having the construct with the desired insert 
5 ("component 1"), randomly picked clones were cultured and plasmids 

checked (see Sambrook et al for general strategy of selecting and analyzying 
recombinants plasmids). A plasmid (modified pBAC of Fig.9) having the 
stuffer I as indicated in Fig.le as insert is then selected for the next step 
5) Introduction of loxP and Xbal sites (Fig. 10) 
10 In order to introduce loxP and Xbal sites into the modified pBAC 

prepared as above, 1 \ig of the modified pBAC was mixed with 0.5 \iM. of 
"primer 1" (5'- 

AGAGAGAGAGATGTAGAATAACTTCGTATAATGTATGCTATACGAAGTTA 
TCTGTCAAACATGAGAATTG-3' )(SEQ ID NO:47), 0.5fxM of "primer 2": (5'- 
15 GAGAGAGAGATCTAGATAACTTCGTATAGCATACATTATACGAAGTTATC 
GAATTTCTGCCATTCAT-3' )(SEQ ID NO:48), 125 \iM dNTP mix, lx "GC 
buffer 1" (Takara, Japan) , 5 units of LA-Taq (Takara, Japan) in a volume of 
50 [xL. 

Then, the following PGR amplification cycle was repeated for 25 
20 times; step 1: 94°C for 5 sec; step 2: 50°C for 5 sec, 72°C for 12 min. 

After amplification, 1 \xl of 0.5M EDTA, 1 \xl of 10% SDS and 1 \il of 
proteinaseK, (10 mg/ml stock) (Sigma) were added to the PCR products 
obtained, incubated at 45°C for 15 min and followed by phenol/chloroform 
treatment, chloroform extraction and then ethanol precipitation (Sambrook 
25 et aL, 1989). After ethanol precipitation, the pellet was dissolved with 

water and cut with 15 units of restriction enzyme Xbal (NEB) in the buffer 
supplied by the manufacturer (NEB). PCR product was purified after 
electrophoretic separation with TAE 0.8% low-melting agarose gel 
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(SeaPlaque, FMC) at 50 V for 1 hour (Sambrook et al., 1989). The PCR 
product was cut and digested with 10 units of beta-agarase (NEB) as 
suggested by the manufacturer (alternatively, also the standard technology 
disclosed in Sambrook et al., 1989 can be used). 
5 The 11.7 kb of PCR product was cut out from the gel and sliced. The 

gel was mixed with 1 ml of lx |3-agarase buffer (NEB). The tube containing 
the gel was put on ice for 30 min to equibrate with lx (3-agarase buffer. The 
buffer was removed from the tube and put a new lx (3-agarase buffer. The 
tube was put on ice for 30 min. This buffer exchange cycle was repeated 

10 once more. The buffer was removed and the tube was incubated at 65°C for 
5 min to melt the gel. 10 unit of p-agarase (NEB) were added to the tube 
and incubated for 5 hours. Phenol/chloroform extraction was done and 
precipitated with ethanol following standard techniques (Sambrook et al., 
1989). The precipitated 11.7 kb fragment was dissolved with 5 [il of TE (10 

15 mM Tris-HCl, 1 mM EDTA, pH 7.5) and indicated as "component 4" (fig. 10). 
6) Preparation of stuffer II ("component 5")(Fig.ll) 

To prepare the 1.8 kb stuffer as a size balancer (also indicated as 
"stuffer IF), 3 \ig of mouse genomic DNA was digested with 20 units of 
SauSAI and lx supplied buffer (Nippon Gene, Japan) for 2 hours at 37°C in a 

20 volume of 20 \xh The digested DNA was separated with 1.2% low-melting 
agarose gel at 50 V for 2 hours with la.mbd.Si/StyL molecular marker (Nippon 
Gene, Japan). DNA fragments that migrated showing a size of about 1 1.8 
kb were cut out of the gel and sliced. The gel was mixed with 1 ml of lx (3- 
agarase buffer (NEB). The tube containing the gel was put on ice for 30 

25 min to equibrate with lx |3-agarase buffer. The buffer was removed from 
the tube and put a new lx (3 -agar a se buffer. The tube was put on ice for 30 
min. This buffer exchange cycle was repeated once more. The buffer was 
removed and the tube was incubated at 65°C for 5 min to melt the gel. 10 
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unit of [3-agarase (NEB) were added to the tube and incubated for 5 hours. 
Phenol/chloroform extraction was done and precipitated with ethanol 
following standard techniques (Sambrook et al., 1989). The precipitated 1.8 
kb stuffer II DNA was dissolved with 10 \d of TE (10 mM Tris-HCl, 1 mM 
5 EDTA, pH 7.5). 

The purified 1.8 kb DNAs (100 ng) was ligated with 10 ng 
SauSAl/Xbal adaptor comprising the upper strand: 

5 5 - GAGAGAGAGATCTAGAAAGCTCCA- 3' (SEQ ID NO:49), and the lower 
strand: 5'- GATCTGGAGCTT-3' (SEQ ID NO:50) for 16 hours at 16°C in the 

10 presence of lx ligation buffer (diluted stock as above described) and 400 

units of T4 DNA ligase (NEB) in a final volume of 5 fil. After inactivation of 
the ligase at 65°C for 5 min, the ligation products were separated by TAE 
1.2% low-melting agarose gel (SeaPlaque, FMC) at 50 V for 1 hour 
(Sambrook et al., 1989). again and 1.8 kb DNA was cut and digested with 

15 beta-agarase (NEB) as suggested by the manufacturer (alternatively, the 
technique described in Sambrook et al., 1989 can be used). 

The 1.8 kb of PGR product was cut out from the gel and sliced. The 
gel was mixed with 1 ml of lx (3-agarase buffer (NEB). The tube containing 
the gel was put on ice for 30 min to equibrate with lx (3-agarase buffer. The 

20 buffer was removed from the tube and put a new lx (3-agarase buffer. The 
tube was put on ice for 30 min. This buffer exchange cycle was repeated 
once more. The buffer was removed and the tube was incubated at 65°C for 
5 min to melt the gel. 10 unit of |3-agarase (NEB) was added to the tube 
and incubated for 5 hours. Phenol/chloroform extraction was done and 

25 precipitated with ethanol following standard techniques (Sambrook et al., 
1989). The precipitated 1.8 kb fragment was dissolved with 5 [xl of TE (10 
mM Tris-HCl, 1 mM EDTA, pH 7.5), 

The 1.8 kb of the purified DNA was amplified using 0.5 [xM Xbal 
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primer (5' -GAGAGAGAGATCTAGAAAGCTCCA-3 5 )(SEQ ID NO:49), 125 
dNTPs mix, lx GC buffer I (Takara, Japan), 5 units of LA-Taq (Takara)in a 
final volume of 50 [xl. 

For the PCR amplification of DNA, the following cycle was repeated 
5 25 times: step 1: 94°C for 5 sec; step2: 68°C for 1.5 min. 

After amplification, 1 ^1 of 0.5M EDTA, 1 ^tl of 10% SDS and 1 yd of 
proteinaseK, (10 mg/ml stock) (Qiagen) were added to the PCR products 
obtained, incubated at 45°C for 15 min and followed by phenol/chloroform 
treatment, chloroform extraction and then ethanol precipitation (Sambrook 
10 et al, 1989). After ethanol precipitation, the pellet was dissolved with 

water and cut with 15 units of restriction enzyme Xbal (NEB) in the buffer 
supplied by the manufacturer (NEB). 

PCR products/X£a/were separated with TAE 0.8% low melting point 
gel at 50V for 1 hour and cut out a 1.8 kb DNA fragment. This DNA 
15 fragment was digested with beta-agarase (NEB) as suggested by the 
manufacturer. 

The 1.8 kb of PCR product was cut out the gel and sliced. The gel 
was mixed with 1 ml of lx (3-agarase buffer (NEB). The tube containing the 
gel was put on ice for 30 min to equibrate with lx (3-agarase buffer. The 

20 buffer was removed from the tube and put a new lx (3-agarase buffer. The 
tube was put on ice for 30 min. This buffer exchange cycle was repeated 
once more. The buffer was removed and the tube was incubated at 65°C for 
5 min to melt the gel. 10 unit of (3-agarase (NEB) were added to the tube 
and incubated for 5 hours. Phenol/chloroform extraction was done and 

25 precipitated with ethanol following standard techniques (Sambrook et al., 
1989). The precipitated 1.8 kb fragment was dissolved with 5 [xl of TE (10 
mM Tris-HCl, 1 mM EDTA, pH 7.5). 

The purified PCR products/J£&aI were named "component 5" (see 
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Figure 11). 

7) Preparation of "component 6" (Fig, 12) 

The cohesive termini (cos ends) of 10 \ig of the (linear) A.-FLC-I-E 
(Fig.2a) annealed (the two complementary cos ends and the ends anneal to 
5 each other after this treatment; this increase ligation efficiency in later steps 
and simplify further procedures) by incubation for 2 hours at 42°C in 180 jxl 
of 10 mM Tris-Cl (pH 7.5), 10 mM MgCl 2 , and 20 ptl of lOx ligation buffer 
provided by NEB. 400 units of T4 DNA ligase (NEB) were added to the 
solution, and the sample was incubated for 5 hours at room temperature, 

10 followed by ligase inactivation for 15 min at 65°C. The X. DNA with the cos- 
ends ligated in the previous step was digested with 5 units of Xbal (Nippon 
Gene, Japan), lx manufacturers supplied buffer for 2 hours at 37°C in a 
volume of 50 jil. After digestion, 1 jxl of 0.5M EDTA, 1 \xL of 10% SDS and 1 
id of proteinaseK, (10 mg/ml stock) (Qiagen) were added to the DNA obtained, 

15 incubated at 45°C for 15 min and followed by phenol/chloroform treatment, 
chloroform extraction and then ethanol precipitation (Sambrook etal, 1989). 
After ethanol precipitation, the pellet was dissolved with water for 30 min 
while the tube was kept on ice, the digested DNA was separated in TAE 0.6% 
low-melting agarose gel at 50 V for 5 hours. Cos-ligated fragment (29 kbp) 

20 was cut out the gel and sliced. The gel was mixed with 1 ml of lx (3-agarase 
buffer (NEB). The tube containing the gel was put on ice for 30 min to 
equibrate with lx (3-agarase buffer. The buffer was removed from the tube 
and put a new lx (3-agarase buffer. The tube was put on ice for 30 min. 
This buffer exchange cycle was repeated once more. The buffer was 

25 removed and the tube was incubated at 65°C for 5 min to melt the gel. 10 
unit of (3-agarase (NEB) were added to the tube and incubated for 5 hours. 
Phenol/chloroform extraction was done and precipitated with ethanol 
following standard techniques (Sambrook et aL 1989). The precipitated 29 
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kb cos-ligated fragment was dissolved with 5 \xl of TE (10 mM TYis-HCl, 1 
mM EDTA, pH 7.5), named "component 6" (Fig.12). 
8) Ligation of "components 4. 5 and 6" (Fig.l2> 

The "component 4" (modified pBAC), "component 5" (stuffer) and 
5 "component 6" (arms) were mixed in the following ratio: 120 ng: 19 ng: 300 
ng, in presence of lx ligation buffer (NEB ligation buffer) and 400 units of T4 
DNAligase NEB in 5 |li1 for 16 hours at 16°C. 

After in vitro packaging ("MaxPlax™ Lambda Packaging Extract", 
EPICENTRE TECHNOLOGIES, Madison WI, US) and plating the 
10 recombinant X-phage (as described in Sambrook et al., 1989), a few hundreds 
plaques of A, phages were obtained. 

5 clones (phage plaques) were randomly selected according to the 
method described in Sambrook et al., 1989. 

The picked phage plaques were put in SM Buffer (Sambrook et al., 
15 1989) and left at room temperature for 1 hour. Then, the eluted phage 

solution was used to infect C600 cells and were amplified according to the 
standard protocol (Sambrook et al., 1989). 

In 3 out 5 clones we obtained the desired inserts (corresponding to 
"component 1") by analysis with restriction enzymes (Xbal+BamHI+Sall, 
20 Xbal+BamHI, Xbal+Sall) (Sambrook et al., 1989). One of this clone, named 
X-FLC-III-pBAC (Fig.12) shown the same cloning range of other described X- 
vectors (for example, A-FLC-I-B, A-FLC-II-C, X-FLC-III-F) which was 0.2- 
15.4 kb. 
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CLAIMS 

1. A cloning bacteriophage vector comprising a construction segment 
(CS) and a replaceable segment (RS), wherein the size of CS is: X-1.2 kb =s CS 

< X; wherein X corresponding to the minimum size necessary to the vector 
5 for undergoing packaging. 

2. The cloning vector of claim 1, wherein the size of CS is: X-0.2 kb. 

3. A cloning bacteriophage vector comprising a construction segment 
(CS) and a replaceable segment (RS), wherein the size of CS is: 36.5 kb <; CS 

< 38 kb. 

10 4. The cloning vector of claim 3, wherein CS is 37.5 kb. 

5. The cloning vector of claim 4, wherein CS is or comprises a foreign 
segment of 5.5 kb. 

6. The cloning vector of claims 1-5, wherein said bacteriophage is X. 

7. The cloning vector of claims 1-6, wherein CS is a bacteriophage 
15 vector segment modified by comprising a plasmid segment at least 

comprising a ori. 

8. The cloning vector of claim 7, wherein said plasmid segment 
comprising a ori is selected from the group of: pBluescript (+), pUC, pBR322, 
and pBAC. 

20 9. The cloning vector of claims 1-8, wherein CS further comprises at 
least a selectable marker selected from the group consisting of: a DNA 
segment that encodes a product that provides resistance against otherwise 
toxic compounds; a DNA segment that encodes a product that suppresses the 
activity of a gene product; a DNA segment that encodes a product that is 

25 identifiable; a DNA segment that encodes a product that inhibits a cell 
function; a DNA segment that provides for the isolation of a desired 
molecule; a DNA segment that encodes a specific nucleotide recognition 
sequence which is recognized by an enzyme. 
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10. The cloning vector of claim 9, wherein said selectable marker 
comprises at least a marker selected from the group consisting of an 
antibiotic resistance gene, an auxotrophic marker, a toxic gene, a phenotypic 
marker, an enzyme cleavage site, a protein binding site; and a sequence 

5 complementary to a PGR primer sequence. 

11. The cloning vector of claims 1-10, wherein said RS is flanked by two 
recombination sites, and said two recombination sites do not recombine with 
each other. 

12. The cloning vector of claim 11, wherein said two recombination sites 
10 are selected from the group consisting of attB, attP, attL, attR and 

derivatives thereof. 

13. The cloning vector of claim 11, wherein said two recombination sites 
flanking RS are lox recombination sites, which do not recombine with each 
other. 

15 14. The cloning vector of claims 1-13, wherein CS further comprising two 
lox recombinant sites, said two lox recombination sites being capable of 
recombine with each other. 

15. The cloning vector of claims 13-14, wherein the recombinant sites are 
loxP sites or derivatives thereof. 
20 16. The cloning vector of claims 1-15, wherein RS further comprising at 
least a background-reducing sequence. 

17. The cloning vector of claim 16, wherein said at least a background- 
reducing sequence is selected from the group consisting of: i) the ccdB gene, 
ii) the lacZ gene, hi) a lox sequence. 
25 18. The cloning vector of claim 17, wherein said hi) lox sequence is loxP 
or a derivative thereof. 

19. The cloning vector of claims 1-18, wherein RS is flanked by i) two 
homing endonuclease asymmetric recognition site sequences, which do not 
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ligate with each other; or ii) two restriction asymmetric endonuclease 
cleavage sites sequences, which do not ligate with each other, recognizable 
by class IIS restriction enzymes. 

20. The cloning vector of claim 19, wherein said homing endonuclease is 
5 selected from the group consisting of: I-Ceul, Pl-Scel, PI-PspI, and I-Scel. 

21. The cloning vector of claim 20, wherein said homing endonuclease 
asymmetric recognition site sequences are sequences from 18 to 39 bp. 

22. The cloning vector of claims 1-21, which is linear. 

23. The cloning vector of claim 1-22, wherein RS is replaced by a nucleic 
10 acid insert of interest. 

24. The cloning vector of claim 23, wherein said insert is selected from 
the group consisting of DNA, cDNAand RNA/DNA hybrid. 

25. The cloning vector of claim 23, wherein said insert is a long cDNA. 

26. The cloning vector of claim 23, wherein said insert is a full-length 
15 cDNA. 

27. The cloning vector of claim 26, wherein said full-length cDNA is a 
normalized and/or subtracted full-length cDNA. 

28. A method for cloning a nucleic acid insert of interest or for preparing 
a bulk nucleic acid library of interest, comprising the steps of: 

20 (a) preparing at least a cloning vector according to claims 1-22; 

(b) replacing RS with a nucleic acid insert of interest into the cloning 
vector obtaining the product according to claims 23-27; 

(c) allowing the in vivo or in vitro excision of the nucleic acid insert of 
interest or of the plasmid comprising the nucleic acid insert of 

25 interest; 

(d) recovering the (recombinant) plasmid carrying the nucleic acid 
insert of interest or a library of these plasmids. 

29. The method of claim 28, wherein between step b) and c) a step of 
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amplification of the cloning vector is carried out. 

30. A bacteriophage cloning vector comprising a construction segment 
(CS) and a replaceable segment (RS), wherein said RS comprises at least the 
ccdB gene. 

5 31. A bacteriophage or plasmid cloning vector comprising a construction 
segment (CS) and a replaceable segment (RS), wherein said RS comprises at 
least a recombination site or a derivative thereof; or RS is flanked by two 
asymmetric site sequences, which do not ligate with each other, and are 
recognized by restriction endonucleases. 

32. The cloning vector of claims 30-31, wherein said bacteriophage is k. 

33. The cloning vector of claims 30-32, wherein the size of the 
bacteriophage vector CS is: 32 kb =s CS <; 45 kb. 

34. The cloning vector of claims 30-32, wherein CS is: 36.5 kb ^ CS < 38 
kb. 

35. The cloning vector of claim 34, wherein CS is 37.5 kb. 

36. The cloning vector of claim 31, wherein said recombination site is lox 
recombination site or a derivative thereof. 

37. The cloning vector of claim 36, wherein said lox site is a loxP site or 
derivatives thereof. 

38. The cloning vector of claims 30-37, wherein the CS of said vector 
comprises a plasmid segment at least comprising an ori. 

39. The cloning vector of claim 38, wherein said plasmid segment 
comprising an ori is selected from the group consisting of :pBluescript(-K), 
pUC, pBR322 and pBAC. 

40. The cloning vector of claims 30-39, wherein CS further comprises at 
least a selectable marker selected from the group consisting of: a DNA 
segment that encodes a product that provides resistance against otherwise 
toxic compounds; a DNA segment that encodes a product that suppresses the 
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activity of a gene product; a DNA segment that encodes a product that is 
identifiable; a DNA segment that encodes a product that inhibits a cell 
function; a DNA segment that provides for the isolation of a desired 
molecule; a DNA segment that encodes a specific nucleotide recognition 
5 sequence which is recognized by an enzyme. 

41. The cloning vector of claim 40, wherein said selectable marker 
comprises at least a marker selected from the group consisting of an 
antibiotic resistance gene, an auxotrophic marker, a toxic gene, a phenotypic 
marker, an enzyme cleavage site, a protein binding site; and a sequence 

10 complementary to a PGR primer sequence. 

42. The cloning vector of claims 30-41, wherein said RS is flanked by two 
recombination sites, and said recombination sites do not recombine with 
each other. 

43. The cloning vector of claim 42, wherein said recombination sites are 
15 selected from the group consisting of attB, attP, attL, attR, and derivatives 

thereof. 

44. The cloning vector of claim 42, wherein said two recombination sites 
flanking RS are lox recombination sites or derivatives thereof and do not 
recombine with each other. 

20 45. The cloning vector of claim 44, wherein the lox recombination site is 
loxP or a derivative thereof. 

46. The cloning vector of claims 30-45, wherein CS further comprising 
two recombinant sites or derivatives thereof, these two recombination sites 
being capable of recombine with each other. 
25 47. The cloning vector of claim 46, wherein said two recombination sites 
are lox recombination sites or derivatives thereof. 

48. The cloning vector of claim 47, wherein said lox recombination site is 
loxP or a derivative thereof. 
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49. The cloning vector of claim 30-48, wherein said RS further comprises 
the lacZ gene. 

50. The cloning vector of claims 31-49, wherein said asymmetric site 
sequences are i) two homing endonuclease asymmetric site sequences or ii) 

5 two restriction endonuclease cleavage sites sequences recognizable by class 
IIS restriction enzymes. 

51. The cloning vector of claim 50, wherein said restriction homing 
endonuclease capable of cutting said asymmetric site sequences is selected 
from the group consisting of: I-Ceul, Pl-Scel, PI-PspI and I-Scel. 

10 52. The cloning vector of claims 50-51, wherein said homing 

endonuclease asymmetric recognition site sequences are sequences from 18 
to 39 bp. 

53. The cloning vector of claims 30-52, which is linear. 

54. The cloning vector of claim 30-53, wherein RS is replaced by a 
15 nucleic acid insert of interest. 

55. The cloning vector of claim 54, wherein said insert is selected from 
the group consisting of DNA, cDNA and RNA/DNA hybrid. 

56. The cloning vector of claim 54, wherein said insert is a long cDNA. 

57. The cloning vector of claim 54, wherein said insert is a full-length 
20 cDNA. 

58. The cloning vector of claim 57, wherein said full-length cDNAis a 
normalized and/or subtracted full-length cDNA. 

59. A method for cloning a nucleic acid insert of interest or for preparing 
a bulk nucleic acid library of interest, comprising the steps of: 

25 (a) preparing at least a bacteriophage cloning vector comprising a 

construction segment (CS) and a replaceable segment (RS), said 
RS comprising the ccdB gene; 
(b) replacing RS with a nucleic acid insert of interest into the cloning 
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vector; 

(c) allowing the in vivo or in vitro excision of the nucleic acid insert of 
interest or of the plasmid comprising the nucleic acid insert of 
interest; 

5 (d) recovering the (recombinant) plasmid carrying the nucleic acid 

insert of interest and lacking the ccdB gene or a library of these 
plasmids. 

60. The method of claim 59, wherein between the steps b) and c) an 
amplification step of the at least a cloning vector is carried out. 
10 61. A method for cloning a nucleic acid of interest or a bulk nucleic acid 
library of interest, comprising the step of: 

(a) preparing at least a cloning vector according to claims 30-53, 
wherein RS is flanked by two recombination sites, and said two 
recombination sites do not recombine with each other; 
15 (b) replacing RS with a nucleic acid insert of interest into the cloning 

vector obtaining a product according to claims 54-58; 

(c) allowing the in vitro excision of the nucleic acid insert of interest 
by providing to the cloning vector of step b) at least a destination 
vector comprising a destination replaceable segment (RS) flanked 

20 by two recombination sites, said two recombination sites do not 

recombine with each other, and said destination RS comprises at 
least the ccdB gene; 

(d) recovering a recombinant plasmid carrying the nucleic acid insert 
of interest and lacking of the ccdB gene or a library of said 

25 plasmids. 

62. The method of claim 61, wherein between the steps b) and c) an 
amplification step of the at least a plasmid is carried out. 

63. The method of claims 61-62, wherein said two recombination sites of 
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both the cloning vector of step a) and the destination vector of step d) are 
derived from recombination site selected from the group consisting of attB, 
attP, attL, and attR or derivatives thereof. 

64. The method of claims 61-62, wherein said recombination sites 

5 flanking RS are lox recombination sites or derivatives thereof, and do not 
recombine with each other. 

65. The method of claim 64, wherein said lox recombination sites are 
loxP or derivatives thereof. 

66. A method for cloning a nucleic acid insert of interest or for preparing 
10 a bulk nucleic acid library of interest, comprising the steps of: 

(a) preparing at least a cloning vector comprising a construction 
segment (CS) and a replaceable segment (RS), said CS comprising 
two recombination sites which recombine with each other, and 
said RS comprising a recombination site capable of recombining 

15 with one of the two sites placed into CS; 

(b) replacing RS with a nucleic acid insert of interest into the cloning 
vector of step a); 

(c) allowing the in vivo or in vitro excision of the nucleic acid insert of 
interest or of the plasmid comprising the nucleic acid insert of 

20 interest; 

(d) recovering the (recombinant) plasmid carrying the nucleic acid 
insert of interest or a library of said plasmids. 

67. The method of claim 66, wherein said RS and CS recombination sites 
are lox recombination site or derivatives thereof 

25 68. The method of claim 67, wherein said lox site is a loxP site or 
derivatives thereof. 

69. A method for cloning a nucleic acid insert of interest or for preparing 
a bulk nucleic acid library of interest, comprising the steps of: 
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(a) preparing at least a cloning vector comprising a construction 
segment (CS) and a replaceable segment (RS), said RS being 
flanked by two endonuclease asymmetric recognition site 
sequences, which do not ligate with each other; 
5 (b) replacing RS with a nucleic acid insert of interest comprising two 

endonuclease asymmetric recognition site sequences flanking said 
insert of interest, said sequences being capable of ligating with 
the two sequences placed into the vector of step a), and obtaining 
a vector comprising the nucleic acid insert of interest; 
10 (c) allowing the in vivo or in vitro excision of the nucleic acid insert of 

interest or of the plasmid comprising the nucleic acid insert of 
interest; 

(d) recovering the (recombinant) excised plasmid or destination 

plasmid carrying the nucleic acid insert of interest or a library of 
15 said plasmids. 

70. The method of claim 69, wherein said endonuclease asymmetric 
recognition site sequences are: i) two homing endonuclease asymmetric 
recognition site sequences; or ii) two asymmetric restriction endonuclease 
cleavage site sequences recognizable by class IIS restriction enzymes. 
20 71. The method of claim 70, wherein said restriction homing 

endonucleases capable of cutting said asymmetric site sequences are selected 
from the group consisting of: I-Ceul, Pl-Scei, PI-PspI and I-Scel. 
72. The method of claims 70, wherein said homing endonuclease 
asymmetric site sequences are from 18 to 39 bp. 
25 73. A method for cloning a nucleic acid insert of interest or preparing a 
bulk nucleic acid library of interest comprising the steps of: 

(a) preparing at least a cloning vector, comprising a construction 
segment (CS) and a replaceable segment (RS), wherein said CS is 
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a bacteriophage vector comprising two lox recombination sites or 
derivatives thereof; 
(b) replacing RS with a nucleic acid insert of interest into the cloning 
vector; 

5 (c) packaging of the vector; 

(d) in vivo in liquid-phase infection of at least a cell expressing Cre- 
recombinase; 

(e) allowing the in vivo in liquid-phase excision of at least a plasmid 
comprising the nucleic acid insert of interest under condition of 

10 short-time growth or no growth of the excised plasmid; 

(ii) CO carrying out cellular lysis and recovery of the plasmid carrying the 
insert or of a library of said plasmids. 
74. The method of claim 63, further comprising the step of: 

g) electroporating or transforming at least a cell, not expressing Cre- 
15 recombinase, making the plasmid(s) of step f) penetrating into 

said cell(s); 

h) plating of cell(s) infected as at step g) and recovering the plasmid 
carrying the nucleic acid insert of interest or a library of said 
plasmids. 

20 75. The method of claims 72-74, wherein said bacteriophage is X. 

76. The method of claim 73, wherein said lox recombination sites are 
loxP or derivatives thereof. 

77. The method of claims 73-76, wherein between the steps c) and d) an 
amplification of the packaged vector(s) is carried out. 

25 78. The method of claims 73-77, wherein the cloning vector of step a) is a 
cloning vector according to claims 1-22 or 30-53, and the product of step b) is 
a vector comprising the insert of interest according to claims 23-27 or 54-58. 
79. The method of claim 73, wherein the step e) is carried out in 0-3 
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hours at the temperature 20-45 ° C. 

80. A method for cloning a nucleic acid insert of interest or for preparing 
a bulk nucleic acid library of interest comprising the step of: 

(a) preparing at least a cloning vector, comprising a construction 

5 segment (CS) and a replaceable segment (RS), wherein said CS is 

a bacteriophage vector segment comprising two lox recombination 
sites or derivatives thereof positioned at left and right side of said 
RS; 

(b) replacing RS with a nucleic acid insert of interest into the cloning 
10 vector; 

(c) in Prepackaging of the at least a bacteriophage cloning vector of 
step b) in presence of packaging extract; 

(d) extraction of bacteriophage cloning vector from the capside; 

(e) in vitro excision of the plasmid comprising the nucleic acid insert 
15 of interest from the vector in presence of Cre-recombinase; 

(f) recovery of said plasmid or library of plasmids. 

81. The method of claim 80, further comprising the step: 

(g) electroporating or transforming at least a cell, not expressing Cre- 
recombinase, making said plasmid entering into said cell; 

20 (h) plating the cell of step g) and recovering plasmid carrying the 

nucleic acid insert of interest or a library of said plasmids. 

82. The method of claims 80-81, wherein between the steps c) and d), an 
amplification step on plate of the bacteriophage is carried out. 

83. The method of claims 80-82, wherein the lox recombination sites are 
25 loxP or derivatives thereof. 

84. The method of claims 80-83, wherein said bacteriophage is k. 

85. The method of claims 80-84, wherein the cloning vector of step a) is a 
cloning vector according to claims 1-22 or 30-53 and the insert of interest of 
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step b) is according to claims 23-27 or 54-58. 

86. A bacteriophage cloning vector comprising a construction segment 
(CS) and a replaceable segment (RS), wherein said RS is flanked by two 
recombination sites, and said two recombinant sites do not recombine with 

5 each other. 

87. The cloning bacteriophage vector of claim 86, wherein said 
bacteriophage is X. 

88. The cloning vector of claims 86-87, wherein said recombination sites 
are selected from the group consisting of attB, attP, attL, attR and 

10 derivatives thereof. 

89. The cloning vector of claims 86-88, wherein CS further comprises two 
lox recombination sites or derivatives thereof, said lox sites being capable of 
recombining with each other. 

90. The cloning vector of claim 89, wherein said lox recombination sites 
15 are loxP or derivatives thereof. 

91. The cloning vector of claims 86-90, wherein the size of the 
bacteriophage X vector segment (CS) is: 32 kb <; CS ss 45 kb. 

92. The cloning vector of claim 91, wherein CS is: 36.5 kb ^ CS < 38 kb. 

93. The cloning vector of claim 91, wherein CS is 37.5 kb. 

20 94. The cloning vector of claims 86-93, wherein the bacteriophage CS 
comprises a plasmid segment at least comprising an ori. 
95. The cloning vector of claim 94, wherein said plasmid segment 
comprising an ori is selected from the group consisting of: pBluescript(+), 
pUC, pBR322 and pBAC. 

25 96. The cloning vector of claims 86-95, wherein CS further comprises at 
least a selectable marker selected from the group consisting of: a DNA 
segment that encodes a product that provides resistance against otherwise 
toxic compounds; a DNA segment that encodes a product that suppresses the 
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activity of a gene product; a DNA segment that encodes a product that is 
identifiable; a DNA segment that binds a product that modifies a substrate; 
a DNA segment that provides for the isolation of a desired molecule; a DNA 
segment that encodes a specific nucleotide recognition sequence which is 
5 recognized by an enzyme. 

97. The cloning vector of claim 96, wherein said selectable marker 
comprises at least a marker selected from the group consisting of an 
antibiotic resistance gene, an auxotrophic marker, a toxic gene, a phenotypic 
marker, an enzyme cleavage site, a protein binding site; and a sequence 

10 complementary to a PGR primer sequence. 

98. The cloning vector of claims 86-97, wherein RS further comprising at 
least a background-reducing sequence selected from the group consisting of: 
i) the ccdB gene, ii) the lacZ gene, iii) a lox sequence. 

99. The cloning vector of claim 98, wherein said lox sequence is loxP. 
15 100. The cloning vector of claims 86-99, wherein RS is flanked by i) two 

homing endonuclease asymmetric recognition site sequences, which do not 
ligate with each other; or ii) two asymmetric restriction endonuclease 
cleavage sites sequences recognizable by class IIS restriction enzymes. 

101. The cloning vector of claim 100, wherein said homing endonucleases 
20 capable of cutting said asymmetric site sequences are selected from the 

group consisting of: I-Ceul, Pl-Scel, PI-PspI and I-Scel. 

102. The cloning vector of claims 100-101, wherein said homing 
endonuclease asymmetric site sequences are sequences from 18 to 39 bp. 

103. The cloning vector of claims 86-102, which is linear. 

.25 104. The cloning vector of claims 86-103, wherein RS is replaced by a 
nucleic acid insert of interest. 

105. The cloning vector of claim 10, wherein said insert is selected from 
the group consisting of DNA, cDNA, RNA/DNA hybrid. 
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106. The cloning vector of claim 104, wherein said insert is a long cDNA. 

107. The cloning vector of claim 104, wherein said insert is a full-length 
cDNA. 

108. The cloning vector of claim 107, wherein said full-length cDNA is a 
5 normalized and/or subtracted full-length cDNA. 

109. A method for cloning a nucleic acid insert of interest or for preparing 
a bulk nucleic acid library of interest, comprising the steps of: 

(a) preparing at least a cloning vector comprising a construction 
segment (CS) and a replaceable segment (RS), wherein said CS is 

10 a bacteriophage vector segment and RS is flanked by two 

recombination sites, and said two recombinant sites do not 
recombine with each other; 

(b) replacing said RS with a nucleic acid insert and obtaining the 
product of claims 105-108; 

15 (c) in vitro packaging the at least a bacteriophage cloning vector of 

step b); 

(d) allowing the in vitro excision of the nucleic acid insert(s) of 
interest by providing to the at least a cloning vector of step c) an 
at least a destination vector comprising a destination replaceable 

20 segment (RS) flanked by two recombination sites, and said two 

recombination sites do not recombine with each other; 

(e) recovering a recombinant plasmid carrying the nucleic acid insert 
of interest or a library of said plamids. 

110. The method of claim 109, wherein said bacteriophage is k. 

25 111. The method of claims 109-110, wherein said two recombination sites 
of both the cloning vector of step a) and the destination vector of step d) are 
derived from recombination sites selected from the group consisting of attB, 
attP, attL, attR and derivatives thereof. 
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112. The method of claims 109-111, wherein said two recombinant sites of 
both step a) and step d) are lox recombination sites or derivatives thereof, 
which do not recombine each other. 

113. The method of claim 112, wherein said lox recombination site is loxP 
5 or derivative thereof. 

114. The method of claims 109-113, wherein said RS of the destination 
vector of step d) further comprises at least the ccdB gene 

115. The method of claims 109-114, wherein the CS of the vector cloning 
further comprises a selectable marker. 

116. The method of claims 109-115, further comprising the steps of: 
(f) providing an at least a second destination vector comprising a 

destination replaceable segment (RS) flanked by two 
recombination sites, and said two recombination sites do not 
recombine with each other, in contact with the plasmid(s) of step 
(e). 

117. The method of claims 109-116, further comprising a step of 1) 
electroporating at least a cell making the plasmid obtained in step e) or f) 
entering said cell; and 2) plating the cell of step 1) and recovering of the 
plasmid or plasmids carrying the insert 

118. A kit comprising at least a cloning vector or at least a library of 
vectors according to claims 1-27, 30-58, or 86-108. 

119. A method for preparing at least one normalized and/or subtracted 
library comprising the steps of: 

(a) providing at least an excised plasmid or a destination plasmid 
prepared according to claims 28-29, 59-85 or 109-117; 

(b) providing the plasmid of step b) to a pool of nucleic acid targets; 

(c) removing the hybrids; 

(d) collected the normalized and/or subtracted nucleic acid targets. 
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120. The method of claim 119, wherein the plasmid of step b) is treating 
by 1) making at least a nick into only one strand of the double stranded 
plasmid(s); 2) removing the plasmid fragments which have been nicked; 3) 
collecting the single strand(s) which has not been nicked; 4) applying the 

5 steps (c)-(d). 

121. The method of claim 120, wherein the nick is introduced by using the 
Genell protein. 

122. The method of claim 120, wherein the strand which has been nicked 
is removed by an esonuclease. 

10 123. The method of claim 122, wherein the esonuclease is ExoIII. 

124. A method for preparing at least a normalized and/or subtracted 
library comprising the steps of: 

(a) providing at least a vector of claims 1-22, 30-53 or 86-108, 
wherein the CS of the vector comprises a Fl ori; 
15 (b) replacing RS with a nucleic acid insert of interest according to 

claims 23-27, 54-58 or 86-108; 

(c) adding an helper phage and producing a number of a single 
strand plasmid vector copies; 

(d) providing the copies of step c) to a pool of nucleic acids targets; 
20 (e) removing the hybrids; 

(f) collected the normalized and/or subtracted nucleic acid targets. 

125. A bacteriophage vector comprising a bacterial artificial chromosome 
(pBAC) or a segment thereof comprising at least an origin of replication (ori). 

126. The bacteriophage of claim 125, wherein the bacteriophage is 
25 X bacteriophage. 

127. The bacteriophage of claim 125-126, wherein the pBAC or segment 
thereof further comprises: 

- a site into which an DNA fragment can be cloned; 
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- at least one pair of inducible excision-mediating sites flanking the site into 
which the DNA fragment can be cloned, the excision-mediating sites defining 
an excisable fragment that comprises the site into which the DNA fragment 
can be cloned. 

5 128. The bacteriophage of claim 127, wherein the pair of excision- 
mediating sites are FRT sites. 

129. The bacteriophage of claim 127, wherein the pair of excision- 
mediating sites comprise a sequence as shown in SEQ ID NO:45. 

130. The bacteriophage of claims 125-129, wherein the ori is an ori 
10 capable of maintaining the plasmid at single copy. 

131. The bacteriophage of claims 125-130, wherein the pBAC or segment 
thereof further comprises an inducible origin of replication. 

132. The bacteriophage of claim 131, wherein the inducible origin of 
replication is oriV. 

15 133. The bacteriophage of claims 125-126, comprising a bacterial artificial 
chromosome (pBAC) or a segment thereof comprising an inducible origin of 
replication. 

134. The bacteriophage of claims 125-133, comprising at least two 
recombination sites selected from the following: (a) two recombination sites, 

20 wherein either site does not recombine with the other; (b) two lox 

recombination sites, wherein either site is capable of recombining with each 
other; (c) two homing endonuclease asymmetric recognition site sequences; 
(d) two restriction asymmetric endonuclease cleavage site sequences, 
wherein either site sequence does ligate with the other, recognizable by class 

25 IIS restriction enzymes. 

135. The bacteriophage of claim 134, wherein the two recombination sites 
(a) are selected from the group consisting of attB, attP, attL, attR and 
derivatives thereof. 
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136. The bacteriophage of claim 134, wherein the two recombination sites 
(a) are lox recombination sites derivative, which do not recombine with each 
other. 

137. The bacteriophage of claim 134, wherein the two recombination sites 
5 (b) are loxP sites. 

138. The bacteriophage of claim 134, wherein the two homing 
endonuclease site sequences (c) are selected from the group consisting of: I- 
Ceul, Pl-Scel, PI-PspI, and I-Scel. 

139. The bacteriophage of claims 125-138, further comprising at least a 
10 background-reducing sequence. 

140. The bacteriophage of claims 139, wherein the at least background- 
reducing sequence is selected from: a) the ccdB gene; b) the lacZ gene; c) a 
lox sequence. 

141. A method for cloning a nucleic acid of interest or for preparing a bulk 
15 nucleic acid library of interest comprising the steps of: 

(a) preparing a bacteriophage cloning vector according to claims 
125-140; 

(b) inserting a nucleic acid of interest into the bacteriophage 
cloning vector; 

20 (c) allowing the in vivo or in vitro excision of the BAG plasmid 

comprising the nucleic acid insert of interest; 
(d) recovering the BAC plasmid carrying the nucleic acid insert of 
interest or a library of these BAC plasmids. 
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Fig. 2 
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Fig. 5 
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Fig. 6 
Homing endonuclease 
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Fig. 9 
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Fig. 10 
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Fig. 12 
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Fig. 13 



pFLCII ; modified pBluescr iptl I SK(+) 

CTAAATTGTAAGC6TTAATATTTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAGCTC 

ATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGACCGA 

GATAGGGTTGAGTGTTGTTGCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTC 

CAACGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACC 

CTAATCAAGTTTTTTGGGGTCGAGGTGCCGTAAAGCAOTAAATCGGAACCCTAAAGGGAG 

CCCCCGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGGAAGAA 

AGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCGGTCACGCTGCGCGTAACCAC 

CACACCCGGCGCGCTTAATGCGCCGCTAGAGGGCGCGTCCCATTCGCCATTCAGGGTGCG 

CAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGGTATTACGCCAGCTGGCGAAAGG 

GGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTG 

TAAAACGACGGCCAGTGAGCGCGCGTAATACGACTCACTATAGGGCGAATTGGGTACGGG 
GC 

C ACAAGTTTGTACAAAAAAGCAGGCT OTftfiAfi GGATC CACCGAGCTTTCTTGTAC 

AttB1 Xhol BamHI AttB2 

MAGTGGTCTAGACCTCTCTTGGCCG CATAACTTCGTATAGCATACATTATACGAAGTTAT 

!oxP 

GCGGCCGCCACCGCGGTGGAGCTCCAGCTTTTGTTGCGTTTAGT 

GAGGGTTAATTGCGCGCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTT 

ATCCGCTCACAATTCCACACAAGATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTG 

CCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGG 

GAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAAGGCGCGGGGAGAGGCGGTTTGC 

GTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGC 

GGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATA 

ACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCG 

CGTTGCTGGCGTTTTTCGATAGGCTCCGCCCCCCTGACGAGCATGACAAAAATCGACGCT 

CAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATAGCAGGCGTTTCCCCGTGGAA 

GCTCCCTCGTGCGCTCTCCTGTTCCGAGCCTGCCGCTTACCGGATACCTGTCCGCCTTTC 

TCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGT 

AGGTCGTTCGCTCCAAGGTGGGCTGTGTGCACGAAGCCCCCGTTCAGCCCGACCGCTGCG 

CCTTATCCGGTAACTATGGTCTTGAGTCCAACGCGGTAAGACACGACTTATCGCCACTGG 

CAGCAGGCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCT 

TGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGC6CTCTGC 

TGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCG 

CTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTC 

AAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTT 

AAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAA 

AATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTGTGACAGTTACCAAT 

GCTTAATCAGTGAGGGACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCT 

GACTCCCCGTCGTGTAGATAACTACGATAGGGGAGGGCTTACCATCTGGCGCCAGTGCTG 

CAATGATACCGCGAGACCCACGCTCACGGGCTGCAGATTTATCAGCAATAAACGAGCCAG 

GCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTGTATTA 

ATTGTTGCCGGGAAGGTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTG 

CCATTGCTACAGGGATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCG 

GTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCT 

CCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTA 

TGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTG 

GTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCC 

CGGCGTCAATACGGGATAATACCGCGCCAGATAGGAGAACTTTAAAAGTGCTCATCATTG 

GAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACGGCTGTTGAGATCCAGTTCGA 

TGTAACCCAGTCGTGCACCCAAGTGATGTTCAGCATCTTTTACTTTCACCAGCGTTTCTG 

GGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAAT 

GTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTC 

TCATGAGGGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCA 
CATTTCCCGGAAAAGTGCCAC 
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SEQUENCE LISTING 

<110> RIKEN (The Institute of Physical and Chemical Research) 

<120> Cloning vectors and method for molecular cloning 

<130> YCT-692 

<140> 
<141> 

<160> 38 

<170> Patentln Ver. 2. 1 

<210> 1 
<211> 146 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer_bind 
<222> (1).. (18) 

<223> Forward (Fwd) primer binding site 
<220> 

<221> promoter 
<222> (25).. (44) 

<223> T7 polymerase binding site 
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<220> 

<221> misc_recomb 
<222> (76).. (109) 

<223> loxP recombination site recognized by 
Cre-recombinase 

<220> 

<221> proteinjnnd 
<222> (116).. (128) 
<223> restriction site Sfil 

<220> 

<221> protein_bind 
<222> (141).. (146) 
<223> restriction site Sail 

<220> 

<223> Description of Artificial Sequence: polylinker of 
the left arm of vector pFLC-I (fig. 2g) 

<400> 1 

tgtaaaacga cggccagtga attgtaatac gactcactat agggcgaatt ggagctccac 60 

cgcggtggcg gccgcataac ttcgtatagc atacattata cgaagttatg gatcaggcca 120 

aatcggccga gctcgaattc gtcgac 146 



<210> 2 
<211> 125 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<221> protein_bind 
<222> (1).. (6) 

<223> BamHI restriction site 
<220> 

<221> proteinjind 

<222> (7).. (19) 

<223> Sfil restriction site 

<220> 

<221> promoter 
<222> (57).. (76) 

<223> T3 polymerase binding site 
<220> 

<221> primer_bind 
<222> (109).. (125) 

<223> Reverse (Rev) primer binding site 
<220> 

<223> Description of Artificial Sequence: polylinker of 
the right arm of vector pFLC-I (fig. 2g) 

<400> 2 

ggatccggcc ataagggcct gatccttcga gggggggccc ggtaccagct tttgttccct 60 
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ttagtgaggg ttaatttcga gcttggcgta atcatggtca tagctgtttc ctgtgtgaaa 120 
ttgtt 125 



<210> 3 
<211> 102 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primerjnnd 
<222> (1).. (18) 

<223> Forward (Fwd) primer binding site 
<220> 

<221> promoter 
<222> (28).. (47) 

<223> T7 polymerase binding site 
<220> 

<221> misc_recomb 

<222> (66).. (90) 

<223> attBl recombination site 

<220> 

<221> protein_bind 

<222> (91).. (96) 

<223> Xhol restriction site 
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<220> 

<221> protein_Mnd 

<222> (97). . (102) 

<223> Sail restriction site 

<220> 

<223> Description of Artificial Sequence: polylinker of 
the left arm of vector pFLC-II (fig. 2h) 

<400> 3 

tgtaaaacga cggccagtga gcgcgcgtaa tacgactcac tatagggcga attgggtacc 60 
gggccacaag tttgtacaaa aaagcaggct ctcgaggtcg ac 102 

<210> 4 
<211> 188 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> protein_bind 
<222> (1).. (6) 

<223> BamHI restriction site 
<220> 

<221> misc_recomb 
<222> (7).. (30) 

<223> at tB2 recombination site 
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<220> 

<221> misc__recomb 
<222> (51).. (84) 

<223> loxP recombination site recognized by 
Cre-recombinase 

<220> 

<221> promoter 

<222> (120).. (139) 

<223> T3 polymerase binding site 

<220> 

<221> primer_bind 
<222> (172).. (188) 

<223> Reverse (Rev) primer binding site 
<220> 

<223> Description of Artificial Sequence: polylinker of 
the right arm of vector pFLC-II (fig. 2h) 

<400> 4 

ggatccaccc agctttcttg tacaaagtgg tctagacctc tcttggccgc ataacttcgt 60 

atagcataca ttatacgaag ttatgcggcc gccaccgcgg tggagctcca gcttttgttc 120 

cctttagtga gggttaattg cgcgcttggc gtaatcatgg tcatagctgt ttcctgtgtg 180 

aaattgtt 188 



<210> 5 
<211> 93 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer_bind 
<222> (1).. (18) 

<223> Forward (Fwd) primer binding site 
<220> 

<221> promoter 
<222> (30).. (49) 

<223> T3 polymerase binding site 
<220> 

<221> protein_bind 
<222> (62).. (87) 

<223> I-Ceul homing endonuclease recognition site 
<220> 

<221> protein_bind 

<222> (88).. (93) 

<223> Sail restriciton site 

<220> 

<223> Description of Artificial Sequence: polylinker of 
the left arm of vector pFLC-III (fig. 2i) 

<400> 5 

tgtaaaacga cggccagtga attgcgcgca attaaccctc actaaaggga acaaagatgt 60 
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gtaactataa cggtcctaag gtagcgagtc gac 93 

<210> 6 
<211> 103 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> protein_bind 
<222> (1).. (6) 

<223> BamHI restriction site 
<220> 

<221> protein_bind 
<222> (7).. (45) 

<223> Pl-Scel homing endonuclease recognition site 
<220> 

<221> promoter 
<222> (53). . (72) 

<223> T7 polymerase binding site 
<220> 

<221> primer_bind 
<222> (83).. (103) 

<223> Reverse (Rev) primer binding site 
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<223> Description of Artificial Sequence: polylinker of 
the right arm of vector pFLC-1 1 1 (fig. 2i) 

<400> 6 

ggatcctgcc atttcattac ctctttctcc gcacccgaca tagatgcatc gcccctatag 60 
tgagtcgtat tacatagctg tttcctggaa attgttatcc get 103 

<210> 7 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primerjnnd 
<222> (1). . (18) 

<223> Forward (Fwd) primer binding site 
<220> 

<221> promoter 
<222> (27). . (46) 

<223> T3 polymerase binding site 
<220> 

<221> misc_recomb 

<222> (63).. (87) 

<223> attBl recombination site 

<220> 
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<221> protein_bind 

<222> (88).. (93) 

<223> Xhol restriction site 

<220> 

<221> proteinjnnd 

<222> (94).. (99) 

<223> Sail restriction site 

<220> 

<223> Description of Artificial Sequence: polylinker of 
the left arm of vector pFLC-DEST (fig. 2j) 

<400> 7 

tgtaaaacga cggccagtga gcgcgcaatt aaccctcact aaagggaaca aaagctggat 60 
caacaagttt gtacaaaaaa gcaggctctc gaggtcgac 99 

<210> 8 
<211> 117 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> protein_bind 
<222> (1).. (6) 

<223> BamHIi restriction site 
<220> 
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<221> misc_recomb 
<222> (7).. (30) 

<223> at tB2 recombination site 
<220> 

<221> promoter 
<222> (44).. (63) 

<223> T7 polymerase binding site 
<220> 

<221> primer_bind 
<222> (97).. (117) 

<223> Reverse (Rev) primer binding site 
<220> 

<223> Description of Artificial Sequence: polylinker of 
the right arm of the vector pFLC-DEST (fig. 2j) 

<400> 8 

ggatccaccc agctttcttg tacaaagtgg ttgatccaat tcgccctata gtgagtcgta 60 
ttacgcgcgc ttggcgtaat catggtcata gctgtttcct ggaaattgtt atccgct 117 

<210> 9 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

11/33 



WO 02/070720 



PCT/JP02/01667 



<223> Description of Artificial Sequence: linker/primer 
upper oligonucleotide 

<400> 9 

ctaggcgcgc cgagagatct agagagagag 30 

<210> 10 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker/primer 
lower oligonucleotide 

<400> 10 

ctctctctct agatctctcg gcgc 24 

<210> 11 
<211> 68 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amplification 
primer 
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<400> 11 

gagagactcg aggtcgacga gagaggcccg 
aattaact 



ggcggccgcg atcgcggccg gccagtcttt 60 

68 



<210> 12 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amplification 
primer 

<400> 12 

gagagaggat ccgagagagg ccagagaggc catttaaatg cccgggctgc aggaattcga 60 
tat 63 



<210> 13 
<211> 49 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amplification 
primer 

<400> 13 
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gagagagcgg ccgcccgggc eatttaaatc cggcttacta aaagccaga 49 

<210> 14 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amplification 
reverse primer 

<400> 14 

agcggataac aatttcacac agga 24 

<210> 15 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amplification 
primer 

<400> 15 

gagagaggcc tctctggcca ctagtctgca gactggctgt gtata 45 
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<210> 16 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amplification 
forward primer 

<400> 16 

tgtaaaacga cggccagt 18 



<210> 17 
<211> 77 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amplification 
primer comprising the loxP site 

<400> 17 

gagagaggat ccagagagat aacttcgtat aatgtatgct atacgaagtt atgagagagg 60 
ccagagaggc catttaa 77 



<210> 18 
<211> 68 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amplification 
primer 

<400> 18 

gagagactcg aggtcgacga gagaggcccg ggcggccgcg atcgcggccg gccagtcttt 60 
aattaact 68 



<210> 19 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker/adapter 
upper oligonucleotide 

<400> 19 

gatcaggcca aatcggccga gctcgaattc g 31 



<210> 20 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: 1 inker/ad apt e 
lower oligonucleotide 

<400> 20 

tcgagaattc gagctcggcc atttggcct 

<210> 21 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 1 inker/adapte 
upper oligonucleotide 

<400> 21 

gatcaggccc ttatggccgg atccactagt gcggccgca 

<210> 22 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 1 inker/adapte 
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lower oligonucleotide 
<400> 22 

tcgatgcggc cgcctagtgg atccggccat aagggcct 38 

<210> 23 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR T7 Rev 
primer 

<400> 23 

gtgtgatatc gccctatagt gagtcgtatt acatagctgt ttcctgtgtg aaattg 56 

<210> 24 
<211> 70 
<212> DNA 
• <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: PCR T3 Fwd 
primer 

<400> 24 
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gagagatatc tttgttccct ttagtgaggg ttaattgcgc gcaattcact ggccgtcgtt 60 
ttacaacgtc 7° 

<210> 25 
<211> 68 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 25 

gagagactcg aggtcgacga gagaggcccg ggcggccgcg atcgcggccg gccagtcttt 60 
aattaact 68 

<210> 26 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 26 

gagagaggat ccgagagagg ccagagaggc catttaaatg cccgggctgc aggaattcga 60 
tat 63 
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<210> 27 . 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 27 

gtgtaactat aacggtccta aggtagcgag tcgacgagag aggcccgggc ggccgcgat 59 



<210> 28 
<211> 67 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 28 

gcatctatgt cgggtgcgga gaaagaggta atgaaatggc aggatccgag agaggccaga 60 
gaggcca 67 



<210> 29 
<211> 69 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 29 

gagagtctag ataacttcgt atagcataca ttatacgaag ttataaatca atctaaagta 60. 
tatatgagt 69 



<210> 30 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 30 

gagagtctag ataacttcgt ataatgtatg ctatacgaag ttataaaact tcatttttaa 60 
tttaaaagg 69 



<210> 31 
<211> 76 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: AttBl linker 
upper oligonucleotide 

<400> 31 

cgggccacaa gtttgtacaa aaaagcaggc tctcgaggtc gacgagaggc cagagaggcc 60 
ggccgagatt aattaa 76 

<210> 32 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: AttBl linker 
lower oligonucleoide 

<400> 32 

ttaattaatc tcggccggcc tctctggcct ctcgtcgacc tcgagagcct gcttttttgt 60 
acaaacttgt ggcccggtac 80 

<210> 33 

<211> 78 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: At tB2 linker 
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upper oligonucleotide 
<400> 33 

ggccatgacg gccgagagat ttaaatgaga gaggatccac ccagctttct tgtacaaagt 60 
ggtctagacc tctcttgg 78 

<210> 34 
<211> 72 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: At tB2 linker 
lower oligonucleotide 

<400> 34 

gaggtctaga ccactttgta caagaaagct gggtggatcc tctctcattt aaatctcttg 60 
gccgtcatgg cc 72 

<210> 35 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: LoxP linker 
upper oligonucleotide 
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<400> 35 

ccgcataact tcgtatagca tacattatac gaagttatgc 40 

<210> 36 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: LoxP linker 
lower oligonucleotide 

<400> 36 

ggccgcataa cttcgtataa tgtatgctat acgaagttat gcggccaaga 50 

<210> 37 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
junction linker upper oligonucleotide 



<400> 37 
ggccatgaga t 
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<210> 38 
<211> 11 
<212> DNA 

<213> Artificial Sequence ■ 
<220> 

<223> Description of Artificial Sequence: Plasmid 
junction linker lower oligonucleotide 

<400> 38 

ctagatctca t 11 

<210> 39 
<211> 93 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: I -Ceu I/PI -See I adaptor 
oligonucleotide (up adaptor strand) 

<400> 39 

cgcgctaact ataaeggtec taaggtagcg agtcgacgag agagagagga tccatctatg 60 
tegggtgegg agaaagaggt aatgaaatgg cag 93 
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<210> 40 
<211> 93 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: I-CeuI/PI-Scel adaptor 
oligonucleotide (down adaptor strand) 

<400> 40 

cgcgctgcca tttcattacc tctttctccg cacccgacat agatggatcc gagagagaga 60 
gtcgactcgc taccttagga ccgttatagt tag 93 



<210> 41 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Xbal - LoxP Tag primer 3F 
<400> 41 

gagagtctag ataacttcgt atagcataca ttatacgaagt tataaatca atctaaagta 60 
tatatgagt 69 
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<210> 42 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Xbal - LoxP Tag primer 3R 
<400> 42 

gagagtctag ataacttcgt ataatgtatg ctatacgaag ttataaaact tcatttttaa 60 
tttaaaagg 69 



<210> 43 
<211> 520 

<212> DNA (genomic) 

<213> 

<220> 

<223> Description of the artificial sequence: oriV sequence 
<400> 43 

ccggcgttgt ggataccacg cggaaaactt ggccctcact gacagatgag gggcggacgt 60 
tgacacttga ggggccgact cacccggcgc ggcgttgaca gatgaggggc aggctcgatt 120 
tcggccggcg acgtggagct ggccagcctc gcaaatcggc gaaaacgcct gattttacgc 180 
gagtttccca cagatgatgt ggacaagcct ggggataagt gccctgcggt attgacactt 240 
gaggggcgcg actactgaca gatgaggggc gcgatccttg acacttgagg ggcagagtga 300 
tgacagatga ggggcgcacc tattgacatt tgaggggctg tccacaggca gaaaatccag 360 
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catttgcaag ggtttccgcc cgtttttcgg ccaccgctaa cctgtctttt aacctgcttt 420 
taaaccaata tttataaacc ttgtttttaa ccagggctgc gccctggcgc gtgaccgcgc 480 
acgccgaagg ggggtgcccc cccttctcga accctcccgg 520 

<210> 44 
<211> 34 

<212> DNA (genomic) 

<213> 

<220> 

<223> Description of the artificial sequence: yeast FRT element 
<400> 44 

gaagttccta ttctctagaa agtataggaa cttc 34 



<210> 45 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: double strand 
oligonucleotide adaptor upper strand: 
phosphorylated at the 5' end 

<400> 45 

tcgaagcttc eg 12 
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<210> 46 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: double strand 
oligonucleotide adaptor lower strand 

<400> 46 

cgcgcggaag ct 12 



<210> 47 
<211> 70 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 47 

agagagagag atctagaata acttcgtata atgtatgcta tacgaagtta tctgtcaaac 60 
atgagaattg 70 
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<210> 48 
<211> 67 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 48 

gagagagaga tctagataac ttcgtatagc atacattata cgaagttatc gaatttctgc 60 
cattcat 67 



<210> 49 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sau3AI/XbaI adaptor 

upper strand 

<400> 49 

gagagagaga tctagaaagc tcca 34 



<210> 50 
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<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sau3AI/XbaI adaptor 

lower strand 

<400> 50 

gatctggagc tt 12 



<210> 51 
<211> 3003 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pFLC-II 
<400> 51 

ggatccaccc agctttcttg tacaaagtgg tctagacctc tcttggccgc ataacttcgt 60 
atagcataca ttatacgaag ttatgcggcc gccaccgcgg tggagctcca gcttttgttc 120 
cctttagtga gggttaattg cgcgcttggc gtaatcatgg tcatagctgt ttcctgtgtg 180 
aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa agtgtaaagc 240 
ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac tgcccgcttt 300 
ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg cggggagagg 360 
cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc gctcggtcgt 420 
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tcggctgcgg cgagcggtat cagctcactc 
aggggataac gcaggaaaga acatgtgagc 
aaaggccgcg ttgctggcgt ttttccatag 
tcgacgctca agtcagaggt ggcgaaaccc 
ccctggaagc tccctcgtgc gctctcctgt 
cgcctttctc ccttcgggaa gcgtggcgct 
ttcggtgtag gtcgttcgct ccaagctggg 
ccgctgcgcc ttatccggta aetatcgtct 
gccactggca gcagccactg gtaacaggat 
agagttcttg aagtggtggc ctaactacgg 
cgctctgctg aagccagita ccttcggaaa 
aaccaccgct ggtagcggtg gtttttttgt 
aggatctcaa gaagatcctt tgatcttttc 
ctcacgttaa gggattttgg tcatgagatt 
aaattaaaaa tgaagtttta aatcaatcta 
ttaccaatgc ttaatcagtg aggcacctat 
agttgcctga ctccccgicg tgtagataac 
cagtgctgca atgataccgc gagacccacg 
ccagccagcc ggaagggccg agcgcagaag 
gtctattaat tgttgccggg aagctagagt 
cgitgttgcc attgctacag gc&tcgiggi 
cagctccggt tcccaacgat caaggcgagt 
ggttagctcc ttcggtcctc cgatcgtigi 
catggttatg gcagcactgc ataattctct 
tgtgactggt gagtactcaa ccaagtcatt 
ctcttgcccg gcgtcaatac gggataatac 
catcattgga aaacgttctt cggggcgaaa 
cagttcgatg taacccactc gtgcacccaa 
cgtttctggg tgagcaaaaa caggaaggca 



aaaggcggta atacggttat ccacagaatc 480 

aaaaggccag caaaaggcca ggaaccgtaa 540 

gctccgcccc cctgacgagc atcacaaaaa 600 

gacaggacta taaagatacc aggcgtttcc 660 

tccgaccctg ccgcttaccg gatacctgtc 720 

ttctcatagc tcacgctgta ggtatctcag 780 

ctgtgtgcac gaaccccccg ttcagcccga 840 

tgagtccaac ccggtaagac acgacttatc 900 

tagcagagcg aggtatgtag gcggtgctac 960 

ctacactaga aggacagtat ttggtatctg 1020 

aagagttggt agctcttgat ccggcaaaca 1080 

ttgcaagcag cagattacgc gcagaaaaaa 1140 

tacggggtct gacgctcagt ggaacgaaaa 1200 

atcaaaaagg atcttcacct agatcctttt 1260 

aagtatatat gagtaaactt ggtctgacag 1320 

ctcagcgatc tgtctatttc gttcatccat 1380 

tacgatacgg gagggcttac catctggccc 1440 

ctcaccggct ccagatttat cagcaataaa 1500 

tggtcctgca actttatccg cctccatcca 1560 

aagtagttcg ccagttaata gtttgcgcaa 1620 

gtcacgctcg tcgtttggta tggcttcatt 1680 

tacatgatcc cccatgttgt gcaaaaaagc 1740 

cagaagtaag ttggccgcag tgttatcact 1800 

tactgtcatg ccatccgtaa gatgcttttc 1860 

ctgagaatag tgtatgcggc gaccgagttg 1920 

cgcgccacat agcagaactt taaaagtgct 1980 

actctcaagg atcttaccgc tgttgagatc 2040 

ctgatcttca gcatctttta ctttcaccag 2100 

aaatgccgca aaaaagggaa taagggcgac 2160 
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acggaaatgt tgaatactca tactcttcct 
ttattgtctc atgagcggat acatatttga 
tccgcgcaca tttccccgaa aagtgccacc 
ttcgcgttaa atttttgtta aatcagctca 
atcccttata aatcaaaaga atagaccgag 
aagagtccac tattaaagaa cgtggactcc 
ggcgatggcc cactacgtga accatcaccc 
aaagcactaa atcggaaccc taaagggagc 
gcgaacgtgg cgagaaagga agggaagaaa 
agtgtagcgg tcacgctgcg cgtaaccacc 
ggcgcgtccc attcgccatt caggctgcgc 
tcttcgctat tacgccagct ggcgaaaggg 
acgccagggt tttcccagtc acgacgttgt 
gactcactat agggcgaatt gggtaccggg 
gag 



ttttcaatat tattgaagca tttatcaggg 2220 
atgtatttag aaaaataaac aaataggggt 2280 
taaattgtaa gcgttaatat tttgttaaaa 2340 
ttttttaacc aataggccga aatcggcaaa 2400 
atagggttga gtgttgttcc agtttggaac 2460 
aacgtcaaag ggcgaaaaac cgtctatcag 2520 
taatcaagtt ttttggggtc gaggtgccgt 2580 
ccccgattta gagcttgacg gggaaagccg 2640 
gcgaaaggag cgggcgctag ggcgctggca 2700 
acacccgccg cgcttaatgc gccgctacag 2760 
aactgttggg aagggcgatc ggtgcgggcc 2820 
ggatgtgctg caaggcgatt aagttgggta 2880 
aaaacgacgg ccagtgagcg cgcgtaatac 2940 
ccacaagttt gtacaaaaaa gcaggctctc 3000 
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